D-A123  798  COMPUTER-AIDED  DESIGN  AND  ANALVSIS  OF  DIGITAL  GUIDANCE 
AND  CONTROL  SYSTENS(U)  ADVISORY  GROUP  FOR  AEROSPACE 
RESEARCH  AND  DEVELOPMENT  NEUILLY-SUR-SEINE  (FRANCE) 
UNCLASSIFIED  JUL  82  AGARD-LS-128  F/G  17/7 


MICROCOPY  RESOLUTION  TEST  CHART 

NATIONAL  BURE.AU  OF  ST  AN  DARDS  - 1 963  -  A 


AGARD-LS-128 


e 

AGARD-LS-128 


D 


r 


ADVISORY  GROUP  FOR  AEROSPACE  RESEARCH  &  DEVELOPMENT 


7  RUE  ANCELLE  92200  NEUILLY  SUR  SEINE  FRANCE 


AGARD  LECTURE  SERIES  No.  128 

Computer-Aided  Design  and 
Analysis  of  Digital  Guidance 
and  Control  Systems 

r-~r'.'Z 

C‘-  - 

«VCT7 91983 

distribution  is  unlimited.  J  ^  ^ 


NORTH  ATLANTIC  TREATY  ORGANIZATION 


DISTRIBUTION  AND  AVAILABILITY 
ON  BACK  COVER 


I 


AGARD-LS-128 


NORTH  ATLANTIC  TREATY  ORGANIZATION 
ADVISORY  GROUP  FOR  AEROSPACE  RESEARCH  AND  DEVELOPMENT 
(ORGANISATION  DU  TRAITE  DE  L’ATLANTIQUE  NORD) 


AGARD  Lecture  Series  No.l  28 

COMPUTER-AIDED  DES'CN  AND  ANALYSIS  OF  DIGITAL 
GUIDANCE  AND  CONTROL  SYSTEMS 


|  The  material  in  this  publication  was  assembled  to  support  a  Lecture  Series  under  the  sponsorship 

of  the  Guidance  and  Control  Panel  and  the  Consultant  and  Exchange  Programme  of  AGARD 
I  on  5-6  September  1983  in  Stuttgart,  Germany;  8-9  September  1983  in  Athens,  Greece;  and 

12-13  September  1 983  in  Paris,  France. 

L 


THE  MISSION  OF  AGARD 


The  mission  of  AGARD  is  to  bring  together  the  leading  personalities  of  the  NATO  nations  in  the  fields  of  science 
and  technology  relating  to  aerospace  for  the  following  purposes: 

-  Exchanging  of  scientific  and  technical  information; 

-  Continuously  stimulating  advances  in  the  aerospace  sciences  relevant  to  strengthening  the  common  defence 
posture; 

-  Improving  the  co-operation  among  member  nations  in  aerospace  research  and  development; 

-  Providing  scientific  and  technical  advice  and  assistance  to  the  North  Atlantic  Military  Committee  in  the  field 
of  aerospace  research  and  development; 

-  Rendering  scientific  and  technical  assistance,  as  requested,  to  other  NATO  bodies  and  to  member  nations  in 
connection  with  research  and  development  problems  in  the  aerospace  field; 

-  Providing  assistance  to  member  nations  for  the  purpose  of  increasing  their  scientific  and  technical  potential; 

-  Recommending  effective  ways  for  the  member  nations  to  use  their  research  and  development  capabilities  for 
the  common  benefit  of  the  NATO  community. 

The  highest  authority  within  AGARD  is  the  National  Delegates  Board  consisting  of  officially  appointed  senior 
representatives  from  each  member  nation.  The  mission  of  AGARD  is  carried  out  through  the  Panels  which  are 
composed  of  experts  appointed  by  the  National  Delegates,  the  Consultant  and  Exchange  Programme  and  the  Aerospace 
Applications  Studies  Programme.  The  results  of  AGARD  work  are  reported  to  the  member  nations  and  the  NATO 
Authorities  through  the  AGARD  series  of  publications  of  which  this  is  one. 

Participation  in  AGARD  activities  is  by  invitation  only  and  is  normally  limited  to  citizens  of  the  NATO  nations. 


The  content  of  this  publication  has  been  reproduced 
directly  from  material  supplied  by  AGARD  or  the  authors. 


Published  July  1983 

Copyright  ©  AGARD  1983 
All  Rights  Reserved 

ISBN  92-835-1455-6 


Printed  by  Specialised  Printing  Services  Limited 
40  Chigwell  Lane,  Loughton,  Essex  IG10  3TZ 


PREFACE 


This  Lecture  Series  is  intended  to  provide  the  basic  concepts,  theories  and  computer 
methods  involved  in  the  design  of  advanced  guidance  and  control  systems  and  is  sponsored 
by  the  Guidance  and  Control  Panel. 

The  degree  of  advantages  in  the  application  of  modem  microprocessor  technologies  is 
already  largely  affected  by  the  way  corresponding  systems  are  designed  in  the  very  early 
stage  of  a  development  programme. 

Is  intended  to  perform  a  comprehensive  review  of  direct  digital  analysis  and  synthesis 
procedures  and  to  include  in  this  Lecture  Series  computer-aided  and  graphical  techniques 
that  can  be  employed  in  preliminary  design,  synthesis  and  real-time  simulation. 

This  Lecture  Series  has  been  implemented  by  the  Consultant  and  Exchange 
Programme. 


LIST  OF  SPEAKERS 


Lecture  Series  Director:  DrJ.E.Wall 

Systems  and  Research  Center 
Honeywell  Inc. 

2600  Ridgeway  Parkway 
Minneapolis,  Minnesota  55413 
USA 


SPEAKERS 

Professor  G.F. Franklin 
Department  of  Electrical  Engineering 
Stanford  University 
1 3 1  Durand  Building 
Stanford,  California  94305 
USA 

Dr  G.Griibei 
D.F.V.L.R. 

Institute  for  Flight  System  Dynamics 
8031  Oberpfaffenhofen 
Germany 

Surrey 

UK 


Professor  K.J.Astrdm 
Department  of  Automatic  Control 
Lund  Institute  of  Technology 
Box  725 

S-220  07  Lund  7 
Sweden 

Dr  M.J. Denham 
Kingston  Polytechnic 
School  of  Electronic  Engineering 
and  Computer  Science 
Penrhyn  Road 

Kingston  upon  Thames  KT1  2EE 


Professor  A.J.Laub 
Department  of  Electrical  Engineering 
Systems 

University  of  Southern  California 
Los  Angeles,  California  90089 
USA 


CONTENTS 


Page 

PREFACE  iii 

UST  OF  SPEAKERS  iv 

Reference 

INTRODUCTION  AND  OVERVIEW* 

by  J.E.WaO  1 

FUNDAMENTALS  OF  ANALYSIS  FOR  DIGITAL  CONTROL  SYSTEMS 

by  G.F.Franklin  2 

DESIGN  ENVIRONMENTS  AND  THE  USER  INTERFACE  FOR  CAD  OF 
CONTROL  SYSTEMS 

by  M.J. Denham  3 


MODELING  AND  SIMULATION  TECHNIQUES 

by  K.J.Astrdm  4 

NUMERICAL  ASPECTS  OF  CONTROL  DESIGN  COMPUTATIONS 

by  A.J.Laub  5 

PERFORMANCE  AND  ROBUSTNESS  ASPECTS  OF  DIGITAL  CONTROL 
SYSTEMS 

by  J.E.Wall,  J.C.Doyle,  G.L.Hartmann,  N.A.Lehtomaki  and  G.Stein  6 

DIRECT  DIGITAL  DESIGN  VIA  POLE  PLACEMENT  TECHNIQUES 

by  G.F.Franklin  7 

SYSTEMATIC  COMPUTER-AIDED  CONTROL  DESIGN 

by  G.GrQbet  8 

PRACTICAL  ASPECTS  ON  DIGITAL  IMPLEMENTATION  OF  CONTROL 
LAWS 

by  K.J.Astrom  9 


•Paper  not  arailable  at  time  of  printing 


I 

I 

i 
i 

t 

•  FUNDAMENTALS  OF  ANALYSIS  FOR  DIGITAL  CONTROL  SYSTEMS 


G.  F.  Franklin 

Department  of  Electrical  Engineering 
Stanford  University 
Stanford,  California  94305  USA 


2.1  SUMMARY 

The  Intent  of  this  presentation  is  to  provide  the  theoretical  background  and  practical  tools  for  the 
design  of  a  control  system  which  is  to  be  implemented  using  a  computer  or  microprocessor.  The  methods  to 
be  studied  are  primarily  for  closed-loop  (feedback)  systems  in  which  the  dynamic  response  of  the  process 
being  controlled  is  a  major  consideration  in  the  design.  The  design  methods  are  applicable  to  any  type  of 
computer  (from  microprocessors  to  large  scale  computers);  however,  the  effects  of  small  word  size  and  slow 
sample  rates  take  on  a  more  important  role  when  using  microprocessors. 

It  will  be  assumed  in  the  presentation  that  the  reader  has  some  knowledge  of  control  system  design 
methods  for  continuous  (or  analog)  systems  such  as  those  covered  in  the  textbooks  by  Dorf  [1]  or  Ogata  (2]. 
Furthermore,  a  more  complete  reference  for  the  subject  material  can  be  found  in  a  digital  control  textbook 
by  Franklin  and  Powell  [3]. 

A  typical  topology  of  the  type  of  system  to  be  considered  is  shown  in  Fig.  2.1.  There  are  two  funda¬ 
mentally  different  methods  for  the  design  of  digital  algorittas: 

(1)  Continuous  Design  and  digitization:  perform  a  continuous  design,  then  digitize  the  resulting 
compensation, 

(2)  Direct  Digital  Design:  digitize  the  plant  model,  then  perform  a  design  using  discrete  analysis 
methods. 

Both  methods  will  be  covered  and  their  advantages  and  disadvantages  discussed. 


r  “  reference  or  command  input 
y  “  output  quantities 
u  *  actuator  input  signals 
A/D  “  analog-to-digital  converter 
D/A  -  digital-to-analog  converter 


Figure  2.1:  Basic  Control  System  Block  Diagram 


2.2  THEORETICAL  BACKGROUND 
(a)  z  -Transform 

In  the  analysis  of  continuous  systems,  we  use  the  Laplace  Transform  which  is  defined  by: 


L{f(t)} 


F(s) 


f  (t)e 


-st 


dt 


which  leads  directly  to  the  important  property  that 


(2.1) 


L{f ( t )}  «  sF(s)  .  (2.2) 

This  relation  enables  us  to  easily  find  the  transfer  function  of  a  linear  continuous  system  given  the 
differential  equations  of  that  system. 

For  discrete  systems,  a  very  similar  procedure  is  available.  The  "z-transf orm"  is  defined  by 


CO 

Z{f(n)}  »  F(z)  ■  l  f  (n)z_n 

n”-°° 


(2.3) 


which  also  leads  directly  to  a  property  analogous  to  Eq.  (2.2),  specifically  that 


This  relation  allows  us  to  easily  find  the  transfer  function  of  a  discrete  system  given  the  difference 
equations  of  that  system.  For  example,  the  general  2nd  order  difference  equation, 

y(n)  “  -a^yCn-l)  -  a2y(n-2)  +  bQu(n)  +  b^ufn-l)  +  b2u(n-2)  (2.5) 

can  be  converted  from  y(n),  u(n),  etc.,  to  the  z-transform  of  those  variables  by  invoking  Eq.  (2.4)  once 
or  twice  to  arrive  at, 

Y(z)  =  ( -a1z-1-a2z_2)Y(z )  +  (bp+b^^z-2)^ )  (2.6) 

which  results  in  the  transfer  function 

Y(z)  b.  +  b.z-1  +  b.z-2 

-  -  - - ^7 - "T-T  (2‘7) 

ll(z)  1  +  a^z  x  +  a2z 

(b)  z-Transform  Inversion 

A  table  relating  a  few  simple  discrete  time  functions  to  their  z-transform  is  contained  in  Table  2.1 
along  with  the  Laplace  transform  for  the  same  time  functions. 
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J(s)  is  the  Laplace  transform  of  f(t)  and  F(z)  is  the  z-transform  of 
f(nT).  Unless  otherwise  noted,  f(t)  =  0,  t  <  0. 

Table  2. It  z -Transforms 

Given  a  general  z-transform,  one  can  expand  it  into  a  sum  of  elementary  terms  using  partial  fraction 
expansion  and  find  the  resulting  time  series  from  the  table.  Again,  these  procedures  are  exactly  the  same 
as  those  used  for  continuous  systems. 

A  z-transform  inversion  technique  which  has  no  continuous  counterpart  is  long  division.  Given  a  z- 
transform. 


one  8 imply  divides  the  denominator  into  the  numerator  using  long  division.  The  result  is  a  series  (perhaps 
infinite)  in  powers  of  z_1  ,  from  which  the  time  series  can  be  found  by  using  Eq.  (2.3). 

For  example,  a  first  order  system  described  by  the  difference  equations, 

y(n)  »  ky(n-l)  +  u(n)  (2.9) 

yields 

Ilii  »  - ^ -  (2.10) 


for  an  impulsive  input. 


*  U(z)  •  1 
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(2.11) 


Therefore,  to  find  the  time  series,  use  long  division  methods  as  follows: 


-1  2-2  3  -3 

1+kz  +lc  z  z+k  z  +  ... 


1-kz'1  /T 


.  "I  1.2-2 

kz  -k  z 

k2z-2+0 

k2z'2-k3z_3 


to  yield  the  infinite  series: 


,  .  ,  -1  x  ,2  -2  t  ,-3  -3  , 

1  +  kz  +kz  +k  z  +... 


The  quotient,  1  +  kz  3  +  k2z  2  +  k3z  3  +  ...  is  Y(z)  which  means  that, 

y(0)  =  1 

y(l)  =  k 
y(2)  -  k2 


y(n)  =  kn 


(c)  Relationship  Between  s  and  z 


For  continuous  systems,  one  often  associates  certain  behavior  for  different  pole  locations  in  the 
s-plane:  oscillatory  behavior  for  poles  near  the  imaginary  axis,  exponential  decay  for  poles  on  the  negative 
real  axis,  and  unstable  behavior  for  poles  with  a  positive  real  part.  The  same  kind  of  association  is  also 
useful  to  designers  of  discrete  systems.  The  equivalent  characteristics  in  the  z-plane  are  related  to  those 
in  the  s-plane  by  the  expression 


where  T  «  sample  period.  This  is  obtained  by  comparing  the  z-transform  of  the  sampled  version  of  a  signal 
with  the  Laplace  transform  of  the  signal  itself.  The  z-transform  Table  2.1  also  includes  the  Laplace  trans¬ 
forms,  which  demonstrates  the  z  »  e8^  relationship  in  the  denominators  of  all  the  table  entries. 

Figure  2.2  shows  the  mapping  of  lines  of  constant  damping,  X,  ,  and  natural  frequency,  u>  ,  from  the 
s-plane  to  the  upper  half  of  the  z-plane  using  Eq.  (2,12).  The  mapping  has  several  important  features: 

(1)  The  stability  boundary  is  the  unit  circle,  |z|  “1. 

(2)  The  small  vicinity  around  z  •  +1  is  essentially  identical  to  the  vicinity  around  s  -  0. 

(3)  z-plane  locations  give  response  information  normalized  to  the  sample  rate,  rather  than  with 
respect  to  time  as  in  the  s-plane. 

(4)  The  negative  real  z  axis  always  represents  a  frequency  of  “s/2»  where  cog  »  sample  rate. 

(5)  Vertical  lines  in  the  left-hand  s-plane  (constant  real  part  or  time  constant)  map  into  circles 
within  the  unit  circle. 

(6)  Horizontal  lines  in  the  s-plane  (constant  imaginary  part  or  frequency)  map  into  radial  lines 
in  the  z-plane. 

(7)  Every  location  in  the  z-plane  represents  many  frequencies  separated  by  U)-.  Physically,  this  is 
because  many  different  signals  can  have  the  same  samples  and  mathematically  because  of  the  nature 
of  the  exponential  function  in  Eq.  (2.12). 


(d)  Final  Value  Theorem 

The  final  value  theorem  for  continuous  systems 


lim  sX(s) 
et*0 


(2.13) 


t  4 


\! 


2-4 


Figure  2.2:  Natural  Frequency  and  Damping  Loci  in  z-Plane 

is  often  used  to  find  steady  state  system  errors  and/or  steady  state  gains  of  portions  of  a  control  system. 
The  analog  for  discrete  systems  is  obtained  by  noting  that  a  continuous  constant  steady  response  has  the 
transform  X(s)  =  A/s  and  leads  to  the  multiplication  by  s  in  Eq.  (2.13).  Therefore,  since  a  constant 
steady  response  for  discrete  systems  is  X(z)  =  A/l-z-1  ,  the  discrete  final  value  theorem  is: 

x(n)  -  lim  (1  -  z_1)X(z)  (2.14) 

n-*»  z-*T 

X( z )  • 58 ( 1+z ) 

For  example,  to  find  the  DC  gain  of  the  transfer  function,  G(z)  »  y' .  =  —£  + — j-jr  ,  let  u(n)  =  1  for 
n  ^  0,  so  that  '  0 


and 


X(z) 


. 58 ( 1+z ) 
(1-z  1)(z+.16) 


Applying  the  final  value  theorem  yields 

x(“) 


lim 

z-*-l 


.58(1  ±  z) 
z  +  .16 


1 


and  therefore,  the  DC  gain  of  G(z)  is  unity.  In  general,  we  see  that  to  find  the  DC  gain  of  any  transfer 
function,  simply  substitute  z  =  1  and  compute  the  resulting  gain. 

Since  the  gain  of  a  system  does  not  change  whether  represented  continuously  or  discretely,  this  calcu¬ 
lation  is  an  excellent  check  on  the  calculations  associated  with  determining  the  discrete  model  of  a  system 


2.3  CONTINUOUS  DESIGN 


The  first  part  of  this  design  procedure  should  be  already  familiar  to  the  reader;  that  is,  the  design 
of  feedback  control  compensation  for  a  continuous  system.  This  design  is  carried  out  as  if  the  system  were 
continuous  and  no  changes  are  required  to  represent  the  fact  that  the  control  will  eventually  be  implemented 
digitally. 

(a)  Digit  itlon  Proce^  es 

The  second  pa, r  f  t.  procedure  is  to  digitize  the  resulting  compensation.  Therefore,  the  problem  to 
be  addressed  is:  j  en  a  D(s),  find  the  best  equivalent  D(z).  Or  more  exactly,  given  a  D(s)  from  the 
control  system  shown  in  Fig.  2.3,  find  the  best  digital  implementation  of  that  compensation.  A  digital 
implementation  requires  that  y  is  sampled  at  some  sample  rate  and  that  the  computer  output  samples  are 
smoothed  in  some  manner  so  as  to  provide  a  continuous  u  .  For  ease  of  hardware  design,  the  smoothing 
operation  is  almost  always  e  simple  hold  (or  zero  order  hold,  "ZOH" )  which  is  shown  in  Fig.  2.4 


i 

V 

vy 
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Therefore,  we  can  restate  the  problem  as:  Find  the  best  D(z)  in  the  digital  implementation  shown 
in  Fig,  2.5  to  match  a  desired  D(s).  It  is  important  to  note  at  the  outset  that  there  is  no  exact  solution 


Compensation 


Figure  2.3:  Continuous  Control  System 


Figure  2.4:  Zero  Order  Hold 


Figure  2.5:  Digital  Compensation  Implementation 


to  this  problem  because  D(s)  responds  to  the  complete  time  history  of  x(t)  whereas  D(z)  only  has 
access  to  the  samples,  x(n).  In  a  sense,  the  various  digitization  approximations  (and  approximations  they 
are)  simply  make  different  assumptions  about  what  happens  to  x(t)  between  the  sample  points. 

Tustin's  Method:  One  digitization  method  is  to  approach  the  problem  as  one  of  numerical  integration. 
Suppose 


U(s)  .  .  1 

y  a  D(s)  “  —  ,  i.e.,  pure  integration 


Therefore, 


x(t)  dt  + 1  x(t)  dt 

JriT~T 


(2.15) 


“  u(nT-T)  +  (area  under  x(t)  over  last  T) 

where  T  »  sample  period,  u(nT)  is  usually  written  u(n)  for  short  and  the  task  at  each  step  is  to  use 
trapezoidal  integration,  i.e.,  approximate  x(t)  by  a  straight  line  between  the  two  samples  (Fig.  2.6). 
Therefore  Eq.  (2.15)  becomes: 


or  taking  the  z-transform. 


u(nT-T)  +  4-  x(nT-T)  +  x(nT) 


(2.16) 


T  1  +  z 


(2.17) 


nT-T  nT 


Figure  2.6:  Trapezoidal  Integration 


Application  of  the  same  integration  approximation  yields 


and,  in  fact,  the  substitution 


s  :  =  -  1  ~  2  (2.18) 

T  1  +  z 

In  any  D(s)  yields  a  D(z)  based  on  the  trapezoidal  integration  formula.  This  is  called  Tust in* s  or 
the  Bilinear  approximation. 

Matched  Pole  Zero  Method  (MP2):  Another  digitization  method,  called  the  "matched  pole-zero"  is  found 
by  extrapolation  of  the  relation  between  the  s  and  z  planes  stated  in  Eq.  (2.12).  If  we  take  the 

z-transform  of  a  sampled  x(t)  ,  then  the  poles  of  X(z)  are  related  to  the  poles  of  X(s)  according  to 

z  *  esT.  However,  we  must  go  through  the  z-transform  process  to  locate  the  zeros  of  X(z).  The  idea  of 
the  matched  pole-zero  technique  is  to  apply  z  =  es^  to  the  poles  and  zeros  of  a  transfer  function.  Since 
physical  systems  often  have  more  poles  than  zeros,  it  is  also  useful  to  arbitrarily  add  zeros  of  D(z)  at 

z  *  -1  (i.e.,  a  (1  +  z-1)  term)  which  causes  an  averaging  of  the  current  and  past  input  values  as  in  the 

trapezoidal  integration  (Tustin*s)  method.  The  gain  is  selected  so  that  the  low  frequency  gain  of  D(s) 
and  D(z)  match  one  another.  The  method  summarized  is: 

(1)  Map  poles  and  zeros  according  to  z  -  es^. 

—1  —1  2 

(2)  Add  (1  +  z  )  or  (1  +  z  )  ,  etc.,  if  numerator  is  lower  order  than  the  denominator. 

(3)  Match  DC  or  low  frequency  gain. 

For  example,  the  matched  pole-zero  approximation  of 


is  D(z) 


(2.19) 


,  .  a  1-e 

where  k  =  —  - - 

b  1-e 


D(z)  = 


(z+l)(z-e  aT) 
( z-1 ) (z-e_bT ) 


(2.20) 


where  k  =>  —  ^ — — 

2b  l-e~aT 

In  both  digitization  methods,  the  fact  that  an  equal  power  of  z  appears  in  numerator  and  denominator 
of  D(z)  Implies  that  the  difference  equation  at  time  n  will  require  a  sample  of  the  input  at  time  n. 
For  example,  the  D(z)  in  Eq.  (2.19)  can  be  written 

=  D(z)  =  k  1-012  | 

X(z )  l-Bz-i 

which  results  in  the  difference  equation, 

u(n)  =  Su(n-l)  +  k[x(n)  -  ax(n-L)]  .  (2.21) 

Modified  Matched  Pole  Zero  Method  (MMPZ):  The  D(z)  in  Eq.  (2.20)  would  also  result  in  u(n)  being 
dependent  on  x(n) ,  the  input  at  the  same  time  point.  If  the  structure  of  the  computer  hardware  prohibits 
this  relation,  or  if  the  computations  are  particularly  lengthy  thus  rendering  Eq.  (2.21)  impossible  to 
implement,  it  may  be  desirable  to  arrive  at  a  D(z)  which  has  one  less  power  of  z  in  the  numerator  than 
denominator  and  hence  the  computer  output,  u(n),  only  requires  input  from  the  previous  time,  i.e.  x(n-l) 
To  do  this,  we  modify  step  (2)  in  the  "matched  pole-zero"  procedure  to  add  one  less  zero  at  z  =  -1  , 
leaving  a  single  zero  at  z  =  «°.  The  second  example 


D(s)  = 


s  +  a 
s(s+b) 


and  then  become 


D(  z )  =  k 


( z  -1 )  ( z  -e  ) 


where  k  = 


and  which  results  in 


(1+e  ',1)u(n-l)-e  ^u(n-2)  +  k|x(n-l)-e  alx(n-2)]  . 


Method  Comparison:  A  numerical  comparison  of  a  magnitude  frequency  response  is  made  in  Fig.  2.7  for 
the  three  approximation  techniques  at  two  sample  rates.  The  results  of  the  D(z)  computations  used  in 
arriving  at  Fig.  2.7  are  shown  in  Table  2.2. 


The  figure  shows  that  all  the  approximations  are  quite  good  at  frequencies  below  about  1/4  the  sampl 
rate,  u>  / 4.  If  oj  /4  is  sufficiently  larger  than  the  filter  break  frequency,  i.e.,  if  the  sampling  is 
fast  enough,  the  break  characteristics  are  accurately  reproduced.  Tustin's  and  the  MPZ  show  a  notch  at 


:-7 


!  D<z) I  |D(z) | 


Figure  2.7:  Comparison  of  Discrete  Approximations 


D(z)  for  D(s)  =  — 7— r 

S  +  !> 

oj  =  15  Hz 
s 

u)  =  3  Hz 
s 

Matched  Pole-Zero  (MPZ) 

.143  Z 

z-,715 

•405  z-tl89 

Modified  MPZ  (MMPZ ) 

•285  z-,715 

,8U  z-,189 

Tustin's 

.143  z  +  1 
z-,713 

454  z  +  ^ 

•  Z-.0914 

Table  2.2:  Digital  Approximations 

a)  /2  due  to  their  zero  term,  (z+1).  Other  than  the  large  difference  at  ws/2  which  is  typically  outside 
tie  range  of  Interest,  the  three  methods  have  similar  accuracies.  Since  the  MPZ  techniques  require  much 
simpler  algebra  than  TueMp's,  they  are  typically  preferred,  although  it  should  be  pointed  out  that  while 
the  MPZ  method  requires  knowledge  of  the  pole  and  zero  locations,  Tustin's  method  does  not  require  that 
the  polynomials  be  factored. 

(b)  Design  Example 
2 

For  a  1/s  plant,  we  wish  to  design  a  digital  controller  to  have  a  closed-loop  natural  frequency, 
u>  ,  of  -0.3  rad/sec  and  £  =  0.7.  The  first  step  is  to  find  the  proper  D(s)  defined  in  Fig.  2.8.  The 
specifications  can  be  met  with 


D(s)  =  k  (2.22) 

s  +  b 

where 

a  =  0.2 

b  =  2.0 

k  =  0.81 

as  can  be  verified  by  the  root  locus  in  Fig.  2.9.  To  digitize  this  D(s),  we  first  need  to  select  a 
sample  rate.  For  a  system  with  u)  ■  0.3  rad/sec,  a  very  "safe"  sample  rate  would  be  a  factor  of  20  faster 
than  u)n,  yielding 

w  «•  0.3  »  20  =  6  rad/sec  . 

s 


Figure  2.8:  Continuous  Design  Statement 
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Figure  2.9:  s-Plane  Locus  vs.  k  Figure  2.10:  Digital  Control  System 


Thus,  let  us  pick  T  =  1  sec.  The  matched  Pole-Zero  digitization  of  Eq.  (2.22)  is  given  by  Eq.  (2.19)  and 
yields 

_  _  A  AA 

D(z )  =  (0.389)  z  _~Q^35  (2.23) 


D(z)  =  0.389  -  0.319  z 

1  -  0.135  z'1 


which  leads  to 


u(n)  =  0.13 5u(n-l)  +  0.389e(n)  -  0.319e(n-l) 


(2.24) 


e(n)  -  r(n)  -  y(n) 

and  completes  the  digital  algorithm  design.  The  complete  digital  system  is  shown  in  Fig.  2.10. 

(c)  Applicability  Limits  of  Method 

If  an  exact  discrete  analysis  or  a  simulation  of  the  system  was  performed  and  the  digitization  was 
determined  for  a  wide  range  of  rates,  the  system  would  be  unstable  for  sample  rates  slower  than  approxi¬ 
mately  5  *  a>n  and  the  damping  would  be  substantially  degraded  for  sample  rates  slower  than  10  xw  .  At 
sample  rates  on  the  order  of  20  * wn  (or  20 x  bandwidth  for  more  complex  systems),  this  design  meBhod  can 
be  used  with  confidence. 

Basically,  the  errors  come  about  because  the  technique  ignores  the  lagging  effect  of  the  ZOH.  An 
approximate  method  to  account  for  this  is  to  assume  that  the  transfer  function  of  the  ZOH  is 


gzoh(s) 


(2.25) 


This  is  based  on  the  idea  that,  on  the  average,  the  hold  delays  by  T/2  and  the  above  is  a  first  order  lag 
with  a  time  constant  of  T/2,  DC  gain  ■  1.  We  could  therefore  patch-up  the  original  D(s)  design  by 
inserting  this  Gzqh(s)  in  the  original  plant  model  and  finding  the  D(s)  that  yields  satisfactory  response. 

One  of  the  advantages  of  using  this  design  method,  however,  is  that  the  sample  rate  need  not  be 
selected  until  after  the  basic  feedback  design  is  completed.  Therefore  the  patch-up  eliminates  this 
advantage,  although  it  does  partially  alleviate  the  approximate  nature  of  the  method,  which  is  the  primary 
disadvantage. 

2.4  DISCRETE  DESIGN 

(a)  Analysis  Tools 


The  first  step  in  performing  a  control  design  or  analysis  of  a  system  with  some  discrete  elements  in 
it  is  to  find  the  discrete  transfer  function  of  the  continuous  portion.  For  a  system  similar  to  that  shown 
in  Fig.  2.1,  we  wish  to  find  the  transfer  function  between  u(n)  and  y(n).  Unlike  the  previous  section, 
there  is  an  exact  discrete  equivalent  for  this  system  because  the  ZOH  precisely  describes  what  hapi ens 
between  samples  and  the  output,  y(n),  is  only  dependent  on  the  input  at  the  sample  times,  u(n). 

For  a  plant  described  by  a  G(s)  and  preceded  by  a  ZOH,  the  discrete  transfer  function  is: 

G(z )  -  ( 1  -  z-1  )Z  |  J  (2. 24) 

(  s  ) 


Figure  2.11: 

(a)  Mixed  Control  System  (b)  Pure  Discrete  Equivalent 

where  Z  F(s)  means  the  z-transform  of  the  time  series  whose  Laplace  transform  is  F(s),  i.e.,  the  same 
line  in  Table  2.1.  The  formula  has  the  term  G(s)/s  because  the  control  comes  in  as  a  step  input  during 
each  sample  period.  The  term  (1-z-1)  is  there  because  a  one-sample  duration  step  can  be  thought  of  at.  an 
infinite  duration  step  followed  by  a  negative  step  one  cycle  delayed.  This  formula  (Eq.  (2.26))  allows  us 
to  replace  the  mixed  (continuous  and  discrete)  system  shown  in  Fig.  2.11a  with  the  pure  discrete  equivalent 
system  shown  in  Fig.  2.11b. 

The  analysis  and  design  of  discrete  systems  is  very  similar  to  continuous  ones;  in  fact,  all  the  same 
rules  apply.  The  closed-loop  transfer  function  of  Fig.  2.11b  is  obtained  using  the  same  rules  of  block 
diagram  reduction,  i.e.. 


Y(z)  _  DG 
RTzT  =  1  +  DG 


(2.27) 


Since  we  would  like  to  find  the  characteristic  behavior  of  the  closed-loop  system,  we  wish  to  find  the 
factors  of  the  denominator  of  Eq.  (2.27),  i.e.,  find  the  roots  of  the  characteristic  equation: 

1  +  D(z)G(z)  =  0  (2.28) 

The  root  locus  techniques  used  in  continuous  systems  to  find  roots  of  a  polynomial  in  s  apply  equally 
well  here  for  the  polynomial  in  z.  The  rules  apply  directly  without  modification;  however,  the  interpre¬ 
tation  of  the  results  is  quite  different  as  we  saw  in  Fig.  2.2.  A  major  difference  is  that  the  stability 
boundary  is  now  the  unit  circle  instea  of  the  imaginary  axis. 

A  simple  example  of  the  discrete  design  tools  discussed  so  far  should  help  fix  ideas.  Suppose  G(s) 
in  Fig.  2.11a  is: 


It  follows  from  Eq.  (2.26)  that 

G(z  ) 


G(z) 


G(s) 


+  a 


(1  -  z  X)Z 


sTs+aT 


(1  -  (l-e~aT)z~1 

[(l-z”^)(l-e-aTz-1) 


(1  -  cQ 
z  -  a 


(2.29) 


where 


To  analyze  the  performance  of  a  closed-loop  proportional  control  law,  i.e.,  D(z)  =  k,  we  use  standard 
root  locus  rules.  The  result  is  shown  in  Fig.  2.12a  and  for  comparison,  the  root  locus  for  a  continuous 
controller  is  shown  in  Fig.  2.12b.  In  contrast  to  the  continuous  case  which  remains  stable  for  all  values 
of  k,  the  discrete  case  becomes  oscillatory  with  a  decreasing  damping  ratio  as  z  goes  from  0  to  -1  and 
eventually  becomes  unstable.  This  instability  is  due  to  the  lagging  effect  of  the  ZOH  which  is  properly 
accounted  for  in  the  discrete  analysis. 


(a)  z-Plane  Root  Locus 


Figure  2:12 


(b)  s-Plane  Root  Locus 


2-10 


(b)  Feedback  Properties 

In  continuous  systems,  we  typically  start  the  design  process  by  using  proportional,  derivative,  or 
integral  control  laws  or  combinations  of  these,  sometimes  with  a  lag  included.  The  same  ideas  are  used 
in  discrete  designs  directly,  or  perhaps  the  D(z)  that  results  from  the  digitization  of  a  continuously 
designed  D(s)  is  used  as  a  starting  point. 

The  discrete  control  laws  are: 

Proportional : 


u(n) 
=>  D(z ) 


kpe(n) 


(2.30) 


Derivative: 


Integral: 


u(n) 

=  kpfefn)  -  e(n-l)] 

“>  D(n) 

=  VL-Z_1)  -  V 

u(n) 

=  u(n-l)  +  k^e(n) 

=>  D(z ) 

kI  _  kI2 

1-z-1  z-1 

(2.32) 


(c)  Design  Example 

2 

For  an  example,  let  us  use  the  same  problem  as  we  used  for  the  continuous  design;  the  1/s  plant. 
Using  Eq.  (2.26),  we  have 


G(z)  -  - — 


2  (z  -  1)‘ 


(2.33) 


which  becomes  with  T  ■»  1  sec. 


1  z  +  1 

2  (z  -  l)2 


(2.34) 


Proportional  feedback  in  the  continuous  case  yields  pure  oscillatory  motion  and  in  the  discrete  case,  we 
should  expect  even  worse  results.  The  root  locus  in  Fig.  2.13  verifies  this.  For  very  low  values  of  k 
(very  low  frequencies  compared  to  the  sample  rate)  the  locus  is  tangent  to  the  unit  circle  (C  =  0  and 
pure  oscillatory  motion)  thus  matching  the  proportional  continuous  design.  For  higher  values  of  k,  the 
locus  diverges  into  the  unstable  region  due  to  the  effect  of  the  ZOH  and  sampling. 


Figure  2.13:  z-Plane  for  1/s  Plant 

To  compensate  for  this,  let  us  add  a  velocity  term  to  the  control  law,  or 


u(z)  =  k  1  +  Y(l-z  )  e(z) 


which  yields 


D(z)  -  k(l  +  Y) 


Figure  2.14:  Compensated  z-Plane  Locus  for  l/s^  Example 


Design  Comparison 


The  controller  designed  using  pure  discrete  methods,  Eq.  (2.38),  basically  only  differs  from  the  con¬ 
tinuously  designed  controller,  Eq.  (2.24),  by  the  absence  of  the  u(n-l)  term.  The  u(n-l)  term  in 
Eq.  (11.24)  resulted  from  the  lag  term,  (s+b),  in  the  compensation,  Eq.  (2.22),  which  is  typically  included 
in  analog  controllers  because  of  the  difficulty  in  building  pure  analog  differentiators  and  for  noise  attenu¬ 
ation.  Some  equivalent  lag  in  discrete  design  naturally  appears  as  a  pole  at  z  =  0  (see  Fig.  2.14)  and 
represents  the  one  sample  delay  in  computing  the  derivative  by  a  first  difference.  For  more  noise  attenu¬ 
ation,  the  pole  could  be  moved  to  the  right  of  z  =  0,  thus  resulting  in  less  derivative  action  and  more 
smoothing,  the  same  tradeoff  that  exists  in  continuous  control  design. 

Other  than  the  u(n-l)  term,  the  two  controllers  are  very  similar  (Eqs.  (2.24)  and  (2.38)),  This 
similarity  resulted  because  the  sample  rate  is  fairly  fast  compared  to  u>  ,  i.e.,  to  -  20  x  to  .  For 
designs  at  slower  sample  rates,  the  numerical  values  in  the  compensationsnwould  becoml  increasSngly  different 
as  the  sample  rate  decreased.  For  the  discrete  design,  the  actual  system  response  would  follow  that  indicated 
by  the  z-plane  root  locations,  while  the  continuously  designed  system  response  would  diverge  from  that  indi¬ 
cated  by  the  s-plane  root  locations. 

As  a  general  rule,  discrete  design  should  be  used  if  sampling  slower  than  10  x  <o  .  At  the  very  least, 
a  continuous  design  with  slow  sampling  (co  <  10cun)  should  be  verified  by  a  discrete  analysis  or  simulation 
and  the  compensation  adjusted  if  needed.  A  simulation  of  a  digital  control  system  is  a  good  idea  in  any  case. 
If  it  properly  accounts  for  all  delays  and  possibly  asynchronous  behavior  of  different  digital  modules,  it 
may  expose  instabilities  that  are  Impossible  to  detect  using  continuous  or  discrete  linear  analysis. 

2.5  STATE  VARIABLE  REPRESENTATIONS 

An  alternative  to  models  hased  on  the  Laplace  and  z-transform  is  given  by  the  differential  equations  of 
motion  in  normal  form,  known  as  state  space  models.  A  continuous  model  in  state  form  is  written 


Fx  +  Gu 


y 


Hx  +  Ju 


(2.39) 


m: 

The  n  components  of  the  column  matrix  x  comprise  the  states,  the  m  components  of  the  matrix  u  are 
the  controls  and  y  Is  the  p*  1  output  matrix.  The  elements  of  the  matrices  (F,G,H,J)  are  the 
parameters  of  the  system.  It  can  be  shown  by  variation  of  parameters  that  the  solution  to  (2.39)  may  be 
written  in  the  form 


x(t)  =  e 


°  x(to)  +  f  eF(t"T>Gu(T) 


dx 


(2.40) 


where  the  matrix  exponential  is  defined  by  the  series 


FT  2  T^  FkTk 

*  -  I  +  FT  +  ^  +  ...  +  ^ 


(2.41) 


To  obtain  a  discrete  model  for  the  system  described  by  (2.39)  with  the  controls  given  by  the  piecewise 
constant  output  of  a  zero  order  hold,  we  set  t^  -  kT,  t  ■  kT  +  T  and  find 


y  (k) 
x(k+l) 


Hx(k)  +  Ju(k) 
$x(k)  +  Tu(k) 


(2.42) 


where 


FT 


/ 

JQ 


Ft 


(2.43) 


dx  G 


The  reliable  and  accurate  computation  of  the  exponential  in  (2.43)  is  a  basic  requirement  of  any  computer 
design  package  for  digital  control  systems. 

To  illustrate  the  state  space  method,  we  consider  the  double  integrator  example  shown  in  Fig.  2.15. 


1/s 


1/s 


0  1 

0  0 
F 


[1  0]  x 


Figure  2.15:  An  Example  showing  the  relation  between  the  Block 
Diagram  and  the  State  Variable  Equations 


In  this  case 


and 


(2.45) 


One  of  the  advantages  of  the  state  variable  representation  is  that  many  of  the  features  of  interest 
in  analysis  and  design  can  be  computed  by  well  known  algorithms.  in  particular,  we  can  find  the  poles  and 
zeros  of  a  discrete  system  by  eigenvalue  analysis.  In  order  to  see  the  relations  between  the  state 
equations  (2.42)  and  the  transfer  function  given  by  (2.26),  we  need  to  take  the  z-transform  of  (2.42).  The 

result  is 


solving. 


zX(z)  -  $X(z)  +  TU(z) 


(zI-$)X(z)  -  TU(z) 

x(z)  -  (zi-$)_1ru(z) 


The  z-transform  of  the  output  Is  thus 


If  we  use  the  l/sz  example,  we  compute 


Y(z)  -  (H(zl-$)-1r  +  j)u(z) 


z  01  f  1  T 


z-1  -T 


0  z-1 


1  P-1  T 

(z-1)2  [  0  z-1. 


H(zI-$)  =  [1  0] 


0  z-lj  (z-1) 


and,  finally,  H(zI-4)  T  is 


[z-1  T] 


G(z)  >  [z-1  nr  t2 


2  1  1 


I_  (z+1)  (2.48) 

2  (z-1)2 

-  b(z)/a(z) 

Of  course  (2.48)  Is  the  same  as  (2.33). 

The  denominator  a(z)  of  (2.48),  which  defines  the  poles  of  the  transfer  function,  arises  from 
computing  the  inverse  of  zl  -  $  and  Is  given  by 

a(z)  =  det(zl  -  $)  (2.49) 

While  the  poles  of  the  transfer  function  are  given  by  the  roots  of  a(z)  =  0  from  (2.49),  we  can 
also  compute  the  poles  as  "natural  frequencies"  of  the  state  equations.  Consider  the  equation  with  no  input 

*k+l  *  \ 

k 

and  consider  the  states  to  be  all  moving  together  like  z  for  some  (perhaps  complex)  z.  We  have 

x(k)  -  XqZq 

If  this  form  is  substituted  into  the  equation,  we  find 

v  k+1  .  k 

X0Z0  "  *  X0Z0 


xozo  ■  *xo  (2-50 

Equation  (2.50)  is  an  eigenvalue  equation.  The  scalar  z  is  the  eigenvalue  and  the  vector  X.  is  an 
eigenvector.  If  we  write  (2.50)  as 

(z0I  -  *)X0  -  0 


we  can  see  from  elementary  matrix  algebra  that  these  equations  have  a  solution  only  if 


det(zQI  -  $) 


0 


so  Zg,  the  eigenvalue  of  $  ,  is  a  pole  of  the  transfer  function. 

^  The  zeros  of  the  system  can  be  computed  in  a  similar  way.  Suppose  the  states  and  the  input  move  as 
z*  but  the  output  remains  identically  zero.  Such  a  z^  is  a  zero  of  transmission  of  the  system.  The 
equations  are 

„  k  „  k 

x  -  X^  ,  u  =  U1z1 


rix1 


The  solution  for  z^  from  (2.51)  is  not  an  ordinary  eigenvalue  problem,  but  a  generalized  eigenvalue 
problem  since  we  can  write 


$  ri  X 


-I  0  1  X, 


As  before,  standard  numerical  methods  can  be  used  to  solve  for  the  vectors  X^  ,  and  the  eigenvalues, 
z,,  which  are  the  zeros. 

The  analysis  of  random  inputs  is  also  very  direct  when  equations  are  in  state  variable  form.  One 
source  of  signals  which  may  be  realistically  analysed  this  way  is  the  quantization  which  takes  place  with 
analog  to  digital  conversion  and  round-off  in  fixed  point  arithmetic.  A  sketch  of  the  quantization 
characteristic  is  shown  in  Fig.  2.16.  In  this  case,  Widrow  has  shown  that  the  quantization  error  resulting 


Figure  2.16:  The  input-output  characteristic  of  quantization  or  round-off 

from  these  operations  can  be  reasonably  modeled  as  random  white  noise  with  a  uniform  distribution  between 
-q/2  and  +q/2.  This  distribution  results  in  a  "noise"  which  has  mean  square  value  q^/12. 

To  compute  the  standard  deviation  (root  mean  square  error)  of  a  system  output  having  such  a  noise 
source,  we  proceed  as  follows.  Call  the  noise  e(k)  and  write  the  state  equation  for  the  system  shown 
in  Fig.  2.17  as 

x(k+l)  -  <t>  x(k)  +  Tx  e(k) 

We  assume  the  system  feedback,  if  any,  is  included  in  $  and  ignore  any  other  external  inputs  except  e (k) 
Next,  we  define  the  covariance  matrix  of  x(k)  as 

P(k)  -  E  x(k)xT(k)  (2.53) 


and  substitute  the  state  equation  to  find 
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Figure  2.17:  Diagram  of  an  ideal  system  and  its  practical 
realization  with  a  source  of  internal  noise 
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(2.54) 


Assuming  that  we  want  the  steady  state  solution,  Eq.  (2.54)  can  either  be  solved  sequentially  until  P 
no  longer  varies,  or  else  we  can  set  P(k+1)  =  P(k)  =  P  and  solve  the  resulting  algebraic  Lyapunov 
Equation.  Once  P  is  obtained,  the  mean  square  value  of  y  is  given  by 


E  y 


E  Hx(Hx) 


-  HPH 


(2.55) 


2.6  CONCLUSIONS 

We  have  thus  seen  that  digital  control  systems  can  be  analysed  by  z-transform  and  state  variable 
methods.  With  the  z-transform  representation,  we  can  perform  compensation  designs  using  the  familiar 
techniques  of  root  locus  and  frequency  response.  With  state  variable  representations  we  can  also  analyse 
the  behavior  of  digital  controls  with  computer  aids  such  as  eigenvalue  analysis  and  matrix  exponential 
computations.  The  design  of  digital  systems  using  state  variable  methods  will  be  described  in  Presentation 
7  of  the  lecture  series. 
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This  paper  deals  with  the  design  of  a  total  CAD  environment  in  which  control  systems  design  software  can 
be  embedded.  This  environment  includes  a  rich  set  of  software  tools  for  the  creation,  modification, 
simulation  and  analysis  of  dynamic  system  models  and  a  powerful  user  interface  to  these  tools  which 
incorporates  an  interactive  algorithmic  design  language.  An  analogy  can  be  made  between  such  a  CAD 
environment  and  those  which  exist  currently  for  software  development,  e.g.  Interlisp.  The  features  which 
such  an  environment  should  possess  and  how  these  might  be  developed  are  described  in  the  paper.  Particular 
attention  is  devoted  to  the  user  interface  since  it  is  now  generally  recognised  that  the  quality  of  the 
man-computer  interface  is  crucial  to  the  successful  use  of  a  CAD  system.  The  human  factors  aspects  of  the 
interface  are  reviewed  and  examples  given  of  how  a  CAD  system  can  be  designed  to  take  account  of  these 
aspects.  Reference  will  be  made  to  the  DELIGHT  system  from  the  University  of  California,  Berkeley  as  an 
example  of  a  user-orientated  design  environment. 

Introduction 


In  recent  years  there  has  been  a  proliferation  of  CAD  software  developed  for  use  in  control  system  design. 
Several  major  packages  have  been  developed,  starting  in  the  late  sixties  in  the  U.K. ,  and  many  of  these  are 
in  general  use  today,  having  been  developed  over  the  years  to  a  high  level  of  sophistication  in  respect  of 
the  design  methods  which  they  embody.  Most  of  these  packages  however  have  developed  less  coherently  in 
terms  of  such  aspects  as  efficiency,  ease  of  use,  flexibility,  reliability,  extendability,  etc.  This 
situation  is  perpetuated  by  a  large  proportion  of  the  more  recently  developed  CAD  software,  of  which  there 
is  a  growing  amount  as  a  result  of  the  now  easier  access  to  interactive  computing  facilities.  In  many 
other  fields  of  application  of  computing,  in  particular  in  the  design  of  those  information  systems  which 
are  to  be  accessed  by  both  computer  professionals  and  non-professionals,  i.e.  casual  users  whose  role  is 
not  to  develop  software  but  to  use  it  in  their  particular  application,  there  has  been  a  recent  upsurge  of 
interest  in  the  need  to  provide  both  a  total  integrated  computing  environment  in  which  the  system  can  be 
used  efficiently  and  effectively  and  the  proper  user  interface  to  reduce  the  occurrence  of  user  error, 
fatigue,  frustration,  etc.  There  are  two  major  aspects  to  this  in  respect  of  what  constitutes  a  good  system: 


i.  the  range,  effectiveness  and  integration  of  the  set  of  software  based  facilities,  or  "tools",  which  are 
available  to  the  user 

ii.  the  degree  to  which  "human  factors"  have  been  taken  into  account  in  the  design  of  the  man-computer  or  user 
interface  to  the  set  of  software  tools 


It  would  appear  that  we  are  now  at  a  turning  point  in  the  period  of  development  of  CAD  systems  for  control 
systems  design.  Although  there  are  still  serious  deficiences  in  some  areas  of  application,  a  large  number 
of  the  methods  for  design  now  exist  in  a  sufficiently  mature  and  tested  form  to  be  adopted  by  industry. 
However  industry  is  faced  with  a  wide  range  of  software,  developed  by  many  academic  or  industrial  groups 
with  varying  attitudes  to  those  human  factors  aspects  mentioned  above.  This  has  resulted  in  many  cases  in 
a  reluctance  by  industry  to  adopt  existing  CAD  systems  or  to  involve  themselves  in  the  costly  software 
redevelopment  necessary  to  turn  the  CAD  system  into  a  more  user  orientated,  integrated  form. 

It  is  therefore  clear  that  the  time  has  come  to  review  the  situation  and  consider  what  properties  or  features 
should  be  essential  components  of  the  new  generation  of  control  systems  CAD  software  which  would  overcome 
these  deficiences.  It  is  fortunate  that  this  review  can  be  carried  out  in  the  light  of  recent  developments 
in  integrated  programming  environments  and  of  recent  research  into  the  human  factors  aspects  of  software 
based  systems  in  general.  In  the  remainder  of  this  paper  we  consider  the  features  required  for  a  user 
orientated,  integrated  CAD  system  environment  and  the  means  by  which  they  can  be  achieved.  In  particular  we 
discuss  those  aspects  of  human  factors  which  relate  to  the  design  of  a  CAD  system  user  interface  and  provide 
examples  of  how  these  can  produce  a  system  which  is  both  more  efficient  and  less  prone  to  user  error.  Prior 
to  that  however  we  consider  the  major  components  of  an  integrated  control  system  design  environment  and  how 
they  can  be  integrated  via  a  common  user  interface. 


The  design  environment 


Recent  developments  in  the  design  and  production  of  the  integrated  programming  support  environment  (IPSE) 
have  provided  a  clear  direction  which  the  design  and  development  of  an  integrated  design  support  environment 
(IDSE)  might  take.  Amongst  the  best  examples  of  such  IPSE's  are  the  Interlisp  environment  (1),  (2)  and  the 
proposed  Ada  environment  or  APSE  (3).  Let  us  consider  firstly  what  are  the  major  components  of  an  IPSE  such 
as  Inter lisp. 


The  first  consideration  is  that  for  a  truely  interactive  programming  environment,  the  software  tools  available 
to  the  user  should  in  the  main  themselves  interactive.  In  particular  this  implies  the  need  for  an  interactive 
programming  language,  BASIC  being  a  widely  known  example  of  such  a  language.  The  important  distinction  here 
is  between  languages  which  require  a  lengthy,  tedious  and  error-prone  code  creation,  editing,  compilation, 
and  linking  cycle  in  order  to  reach  a  situation  where  useful  results  are  being  produced  and  languages  where 
more  immediate  results  can  be  obtained,  for  example  subsequent  to  entering  a  single  language  statement.  This 
consideration  implies  that  the  programing  language  for  such  an  environment  must  possess  a  number  of  features, 
including  : 


i.  syntax  checking  of  language  statements  on  entry  -  eliminates  the  frustration  of  syntax  errors  being 
detected  late  in  the  software  development  cycle,  i.e.  at  the  compilation  or  linking  stage 

ii.  ability  to  incrementally  compile  and,  if  required,  execute  sections  of  the  program  as  they  are  created. 

It  is  obvious  that  whilst  not  all  semantic  or  syntactic  errors  can  be  detected  in  this  way,  since  many 
of  these  will  not  be  apparent  until  all  the  program  is  present,  a  large  proportion  of  errors  can  be  thus 
detected,  resulting  in  less  time  being  spent  in  debugging  the  software  at  later  stages  of  the  production 
cycle 

iii.  support  for  a  modular  program  structure  to  permit  the  development  of  large  pieces  of  software  in  a 
systematic  manner,  using  a  methodology  such  as  "top-down"  programming.  This  is  related  to  the  previous 
feature  and  implies  the  need  for  program  modules  to  be  compiled  and,  if  possible,  tested  in  an  independent 
manner.  Moreover,  the  system  should  provide  consistency  checking  between  modules  at  the  time  of  their 
creation,  together  with  the  necessary  tools  for  administration  and  documentation  of  the  modules,  including 
different  generations  fo  the  same  module,  e.g.  after  editing 

iv.  accurate  and  informative  error  reporting  at  every  stage  of  the  program  production  process. 

This  leads  us  to  consider  the  first  component  of  the  design  support  environment  for  control  systems  CAD. 

The  design  language 

The  primary  purpose  of  the  design  language,  analagous  to  that  of  the  programming  language  in  an  IPSE,  is  to 
enable  the  creation  of  an  executable  description  of  the  dynamic  system  under  consideration.  The  process  of 
design,  as  of  programming,  is  to  systematically  correct,  enhance,  extend  and  optimize  this  system  description 
until  it  performs,  under  execution,  in  accordance  with  the  required  performance  specification.  (This  implies 
the  need  to  be  able  to  formally  or  informally  specify  the  required  performance  of  the  system  -  see  later  in  the 
paper  the  discussion  on  formal  specification  tools). 

There  are  many  examples  of  design  languages  which  meet  this  primary  requirement,  since  in  this  respect  the 
design  language  is  equivalent  to  a  system  modelling  language.  Such  examples  include  the  SIMNON  language 
(4)  and  DYMOLA  (5),  the  former  having  interactive  features  such  as  syntax  checking  on  entry,  and  the  latter 
illustrating  the  use  of  a  modular  structure  for  modelling  large  system  by  decomposing  them  into  a  set  of 
smaller  subsystems. 

Most  examples  of  design  languages  are  entirely  textually  based,  i.e.  the  descriptions  are  created  by  means 
of  text  statements,  entered  via  a  terminal  keyboard.  SIMNON  and  DYMOLA  are  typical  of  this  mode  of  use. 

One  could  argue  however  that  this  is  not  the  most  natural  method  for  entry  of  the  structural  or  topological 
aspects  of  the  system  description  and  is  prone  to  error,  especially  in  defining  the  detailed  interfaces 
between  the  various  segments  of  the  description.  A  more  natural  method  for  the  system  designer  to  enter 
the  structural  or  topological  part  of  the  system  description  is  a  graphical  one  and  this  method  has  been 
well  utilised  in  other  areas  of  CAD  such  as  electronic  circuit  design,  mechanical  component  design  etc. 
Techniques  for  graphical  system  description  have  recently  been  incorporated  into  control  system  CAD  packages, 
providing  either  a  block  diagram  or  signal  flow  graph  form  of  description.  However,  few,  if  any,  of  these 
support  a  specific  modelling  methodology,  such  as  top-down  decomposition,  nor  do  they  provide  a  full  range  of 
interactive  features  such  as  "syntax"  checking  on  entry,  incremental  compilation  and  execution,  and  automatic 
consistency  checking  between  segments  of  the  system  description. 

It  is  clear  however  that  such  a  graphical  design  language  takes  the  form  of  a  very  high-level  languat  for 
structural  description  and  is  in  fact  only  a  pre-processor  into  the  main  design  language  in  which 
norv-structural  aspects  of  the  system  are  described  and  which  is  designed  for  direct  execution  by  an  interpreter 
or  incremental  compilation.  The  design  language  must  therefore  encompass  this  graphical  pre-processor, 
allowing  the  user  a  choice  of  either  using  the  graphical  or  the  textual  mode  for  entry  of  the  system 
description.  Clearly  this  demands  a  close  integration  between  the  design  language  and  its  graphical 
pre-processor.  The  major  components  of  the  design  environment  with  respect  to  the  design  language  requirement 
are  therefore 

i.  a  pre-processor  which  translates  the  high-level  graphical  system  description  into  design  language 
statements 

ii.  a  "syntax"  checker  which  provides  error  reporting  at  both  the  graphical  pre-processor  and  the  design 
language  levels  of  entry  of  system  descriptions 

iii.  a  consistency  checker  which  provides  error  reporting  on  inconsistencies  in  the  interfaces  between 
different  segments  of  the  system  description  (graphical  pre-processor  and  design  language  levels  of 
entry) 

iv.  an  interpreter  or  incremental  compiler  which  will  execute  (compiled)  segments  of  the  system  description 
in  the  design  language, 

v.  a  compiler  from  the  design  language  into  the  "machine  language"  of  the  system  for  fast  execution  of  the 
system  description. 

A  useful  but  not  essential  addition  to  this  list  is  a  "reverse  processor"  which  will  translate  system 
descriptions  in  the  design  language  into  the  equivalent  graphical  high  level  description. 

The  editors 


The  second  major  set  of  components  of  the  design  environment  are  the  system  description  editors.  These 
components  allow  the  user  to  create  and  modify  system  descriptions  in  the  design  language  and  in  its  high 
level  graphical  equivalent  and  are  necessarily  interactive  tools.  In  describing  above  the  need  for  syntax 
checking  on  entry,  we  have  already  defined  one  of  the  most  important  functions  of  the  editors,  which 
therefore  must  be  integrated  components  of  the  design  environment.  It  is  not  sufficient  to  rely  on  use  of 
the  general  purpose  editor  available  on  the  local  computer  system  for  entry  of  system  descriptions,  since 
this  is,  by  definition,  a  non-integrated  component  of  the  design  environment  and  hence  since  it  has  no 
knowledge  of  the  design  language  cannot  provide,  for  example,  the  syntax  checking  requirement  described  above. 


The  form  that  the  graphical  high  level  language  editor  takes  will  depend  on  the  sophistication  of  the 
available  terminal  equipment.  phst,  high  resolution  interactive  graphics  workstations,  such  as 
incorporated  in  the  Three  Rivers/lCL  Perq  and  the  Apollo  machines  allow  full  exploitation  of  interactive 
techniques  using  devices  such  as  tablets  and  "mouses"  and  graphical  enhancement  methods  using  colour, 
variable  intensity,  animation,  etc.  Current  examples  mostly  incorporate  the  use  of  a  screen-based  menu  for 
selecting  a  graphical  symbol  by  positioning  of  a  cursor  over  the  appropriate  symbol.  Cursor  positioning 
can  be  either  by  light  pen  or  "mouse"/tablet  combination.  The  more  sophisticated  systems  make  use  of 
multiple  windows  on  the  graphical  display  for  menus,  text  from  keyboard  interaction  and  display  of  various 
segments  of  the  graphical  system  description.  A  "window  manager"  allows  the  user  to  create  and  manipulate 
the  windows  on  the  screen,  introducing  priority  levels  for  overlapping  windows,  higher  priority  windows 
obscuring  those  with  lower  priority,  just  as  a  stack  of  overlapping  drawing  sheets.  This  permits,  for 
example,  the  user  to  enter  a  description  of  a  segment  of  the  system  in  the  design  language  in  one 
window  whilst  viewing  the  structure  of  the  overall  system  description  in  another,  retaining  further 
windows  for  menu,  error  messages,  system  prompts,  etc. 

For  the  design  language  itself,  a  screen  based  editor  is  preferred,  ideally  using  the  same  techniques  and 
tools  as  the  graphical  language  editor,  e.g.  tablet  for  positioning  the  cursor,  windowing  for  menus, 
simultaneous  display  of  multiple  segments  of  the  system  description  to  aid  editing,  visual  enhancement 
techniques  for  reporting  the  position  of  syntax  errors,  etc. 

We  will  consider  further  the  interactive  features  of  the  editors  when  we  discuss  the  human  factors  aspects 
of  the  design  environment  later  in  this  paper. 

The  interactive  support  tools 

The  third  major  set  of  components  of  the  design  environment  comprises  a  range  of  tools  with  the  sole 
function  of  supporting  the  interactive  use  of  the  design  environment.  The  principal  requirements  of  a 
user  of  any  interactive  computer  system  are 

i.  to  obtain  knowledge  of  the  state  of  the  system  at  any  moment  in  its  use,  e.g.  what  data  entities  are 
available,  what  software  tools  are  available,  what  software  tool  is  in  current  use,  what  mode  of  the 
editor  is  in  current  use,  etc,, 

ii.  to  obtain  a  "history"  of  the  interactive  session  so  far,  in  order  to  determine  what  sequence  of  actions 
has  led  to  the  current  state, 

iii. to  obtain  hard-copy  documentation  of  the  current  state  of  various  data  entities  in  the  system,  e.g.  the 
current  system  description, 

iv.  to  administer  the  various  data  entities  in  the  system,  e.g.  delete  unwanted  descriptions,  provide  back-up 
copies  after  modification  of  current  descriptions, 

v.  to  "undo"  or  reverse  the  effects  of  a  previous  sequence  of  actions,  used  in  conjunction  with  the  "history" 
facility. 

All  the  above  tools  are  necessarily  integrated  components  of  the  design  environment,  some  more  obviously,  e.g. 
the  "history"  and  "undo"  facilities,  some  less  so,  e.g.  the  documentation  tools.  However,  in  the  latter  case, 
whilst  general  purpose  tools,  e.g.  for  producing  listings  of  text  files  or  graphical  hard-copies  can  be  used, 
user  support  can  be  considerably  enhanced  by  the  use  of  special  purpose  integrated  tools,  which,  for  example, 
automatically  incorporate  standard  system  notation  or  specific  project  identification  on  the  documentation 
output,  updated  according  to  the  current  state  of  the  design  procedure. 

The  debugger 

The  fourth  major  component  is  the  interactive  system  description  debugger.  Since  it  is  highly  probable  that 
the  system  description  will  contain  errors,  it  is  necessary  to  provide  a  means  of  tracing  the  errors.  These 
are  essentially  "run-time"  errors,  i.e.  occuring  at  the  time  of  system  execution,  and  will  become  apparent  to 
the  user  either  as  a  result  of  the  run-time  system  or  "executor"  flagging  an  error,  e.g.  a  numerical  underflow 
or  overflow,  or,  more  likely,  by  the  user  detecting  an  unusual  occurrence  in  the  output  data.  In  the  case  of 
a  complex,  large  system  description  (as  in  the  case  of  a  large  program)  it  is  not  an  easy  task  to  detect  such 
errors  and  depends  upon  the  user  both  being  able  to  access  and  monitor  the  execution  at  various  points  in  the 
system  description  and  having  at  least  an  informal  criterion  against  which  to  judge  the  performance  of  the 
system  at  the  point  or  points  monitored.  In  order  to  access  and  monitor  specific  points  in  the  system 
description,  e.g.  the  output  of  various  transducers,  control  inputs  to  subsystems,  etc.  the  graphical  level 
system  description  provides  a  very  useful  tool.  This  allows  the  user  to  identify  points  by  means  of  the 
light  pen  or  cursor  and  then  further  refine  his  selection  by  referencing  specific  variables  via  the  keyboard 
or  by  menu  selection. 

The  technique  used  here  bears  a  strong  resemblance  to  the  methods  which  are  in  current  use  for  accessing 
and  monitoring  signals  in  an  industrial  computer-based  process  control  system.  In  this  situation  the 
process  operator  can  select  subsystems  oi  his  overall  process  by  name  or  graphically  from  the  display  of 
the  overall  process  flow  chart.  Further  refinements  can  be  made  in  this  way  to  lower  level  subsystems 
until  the  one  in  which  the  error  is  occuring  is  detected.  A  graphical  display  can  then  be  obtained  of 
suspect  signal  vs.  time  plots,  the  operator  then  employing  his  knowledge  of  plant  operation  to  detect  errors. 

To  assist  in  the  detection  of  errors,  formal  verification  criteria  can  be  used.  These  are  defined,  in  the 
case  of  the  process  operator,  at  coranissioning  of  the  plant,  and  represent  bounds  on  signal  levels,  rates  of 
change  of  signals,  etc.  Use  of  colour  graphics  allows  the  operator  to  instantly  detect  violation  of  bounds 
from  the  signal  vs.  time  plot,  on  which  the  bounds  are  displayed.  Indeed,  the  violation  of  such  bounds  is 
often  detected  automatically  and  the  situation  brought  to  the  attention  of  the  process  operator  by  means  of 
alarms. 

Whilst  the  situation  in  the  CAD  environment  is  not  so  critical  from  a  safety  or  security  viewpoint  as  it  is 
in  the  process  control  case,  many  of  the  same  techniques  can  be  employed.  The  use  of  formal  specifications 
for  the  system  description  will  permit  the  automatic  verification  of  "correct"  operation  of  certain  components 
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of  the  system  description  at  execution  time,  where  a  formal  verification  criterion  lias  been  attached  to 
that  components.  We  write  "correct"  in  quotes,  since  it  is  clear  that  the  description  cannot  necessarily 
be  verified  entirely  in  this  manner,  but  only  the  fact  that  signals,  or  their  rates  of  change,  etc.,  lie 
within  certain  limits. 

Much  can  be  learnt  for  CAD  from  the  process  control  industry  concerning  the  use  of  colour  graphics  and 
modes  of  interaction.  A  considerable  study  has  been  made  in  that  industry  into  human  factors  aspects  of 
the  techniques  of  interaction  between  the  process  and  its  operator,  since  in  this  field  errors  due  to 
fatigue,  unclear  displays,  etc.  are  critical  if  not  disasterous.  Many  of  these  techniques,  though  not 
of  such  a  critical  nature,  carry  over  into  the  CAD  environment.  In  particular  the  use  of  colour  to  flag 
errors  on  the  graphical  display  of  the  system  description,  the  use  of  colour  on  signal  vs.  time  plots  to 
indicate  signals  going  out  of  bounds,  the  ability  to  expand  selected  subsystems  on  the  graphical  display  to 
examine  in  more  detail  specific  signal  monitoring  points,  are  examples  of  such  techniques. 

An  obvious  requirement  is  for  the  system  description  debugger  to  be  interactive  during  execution  of  the 
description.  This  implies  a  need  for  the  user  to  be  able  to  interrupt  the  system  execution  either 

i.  at  any  point  of  time  defined  by  the  user,  e.g.  at  n  seconds  after  the  start  of  the  execution,  or 

ii.  on  any  signal  or  set  of  signals  attaining  a  specified  condition,  e.g.  violating  an  upper  or  lower 
bound,  or 

iii.  instantaneously,  from  the  keyboard,  e.g.  oi.  visual  detection  of  a  specific  system  condition. 

On  interruption,  the  user  should  be  capable  of  a  wide  range  of  actions,  including  : 

i.  examination  of  all  signal  levels,  parameter  values,  etc.  in  the  system  description  and  their  time 
histories, 

ii.  the  resetting  of  signal  levels,  parameter  levels,  error  bounds,  etc., 

iii.  restarting  the  system  execution,  either  from  the  point  of  interruption  or  another  suitable  start  point, 

iv.  storing  intermediate  results, 

v.  aborting  the  execution. 

The  need  for  an  interactive  debugger  again  stresses  the  importance  of  an  integrated  set  of  tools,  since  it  is 
clear  that  the  debugger  must  be  closely  integrated  with  the  system  execution  component,  the  system  description 
editors  and  the  full  range  of  interactive  support  tools. 

The  optimisation  language 

Closely  related  to  the  design  language,  on  which  we  will  expand  later,  is  the  optimisation  language.  Unlike 
the  program  design  and  development  process,  where  new  code  is  added  or  existing  code  modified  in  a  manual 
way  in  order  to  get  the  program  to  behave  in  a  required  manner,  the  system  design  process,  whilst  also 
possible  in  the  above  manner,  can  also  be  performed,  entirely  or  in  part,  by  an  automatic  process  using  an 
algorithmic  approach.  This  algorithmic  approach  is  generally  based  on  some  optimisation  procedure,  whereby 
"free"  parameters  in  the  system  description,  i.e.  variables  whose  values  can  be  adjusted  externally  to  the 
description,  are  given  a  sequence  of  values  according  to  a  pre-defined  algorithm  until  performance  of  the 
system  description  on  execution  satisfies  some  criteria  or  falls  within  specified  bounds.  Such  algorithms 
can  be  single  step,  as  in  pole  allocation  algorithm  for  state  feedback,  or  multiple  step,  as  in  hill-climbing 
parameter  optimisation  algorithms. 

This  major  component  of  the  design  environment  requires  close  integration  with  the  system  design  language, 
since  it  must  access  the  system  description  during  execution,  i.e.  be  able  to  interrupt  execution,  monitor 
values  and  reset  parameters.  Close  integration  is  also  necessary  at  the  user  interface  level.  Many  of  the 
same  language  contractions  are  required  by  both  the  design  language  for  defining  system  functions  and  by 
the  optimisation  language  for  defining  functions  of  the  optimisation  algorithm.  In  practice  the  design 
language  and  the  optimisation  language  become  indistinguishable  and  henceforth  we  will  refer  to  the  design/ 
optimisation  language  as  a  single  entity,  called,  for  notational  simplicity,  just  the  design  language  (by 
analogy,  an  integrated  programming  support  environment  =  a  programming  language  +  tools,  an  integrated  design 
support  environment  =  a  design  language  +  tools). 

The  shell 


This  is  the  final  major  component  of  the  design  environment  and  is  the  means  by  which  each  of  the  tools  in 
the  environment  are  accessed  or  invoked  by  the  user.  From  the  user's  viewpoint  it  is  basically  a  command 
language  processor,  where  each  command  invokes  a  specific  tool,  e.g.  the  editors,  executor,  debugger,  etc. 
From  the  viewpoint  of  an  integrated  environment  however,  it  is  unnecessary  to  distinguish  between  the  shell 
or  command  language  and  the  design  language.  Since  the  latter  is  capable  of  incremental  compilation  and 
execution,  a  command  can  take  the  form  of  a  procedure  in  this  language,  perhaps  with  a  reserved  name.  Simply 
stating  the  name  of  the  procedure,  together  with  any  arguements,  will  invoke  its  execution  and  hence  the 
execution  of  the  appropriate  design  tool. 

This  latter  aspect  of  the  design  environment  has  a  very  important  implication.  This  is  that  since  the  design 
tools  are  simply  procedures  in  the  design  language,  the  design  tools  must  themselves  be  written  in  the  design 
language  itself  or  some  "lower  level"  language  into  which  the  design  language  is  translated  or  in  which  the 
design  language  can  call  procedures.  This  implication  for  the  interna]  form  of  the  design  environment  will 
become  clearer  when  we  consider  the  example  of  the  DELIGHT  environment. 
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Since,  from  the  user's  viewpoint,  the  shell  is  the  major  means  by  which  interaction  is  carried  out  with  the 
environment,  it  is  important  to  make  this  as  adaptable  as  possible,  both  in  terms  of  replacing  existing 
command  procedures  by  new  user-defined  names  and  of  introducing  new  command  procedures.  This  can  be  done  in 
a  number  of  ways.  For  example,  the  user  can  own  a  personal  "profile"  file  (as  in  the  Three  Rivers/ICL  Perq 
PQOS  system)  which  contains  a  list  of  personalised  command  names  and  a  corresponding  list  of  standard  or 


user-defined  procedure  names.  In  this  way,  one  user  command  name  can  be  used  to  invoke  one  or  a  sequence  of 
standard  procedure  names.  To  add  to  or  otherwise  modify  this  profile  file  the  user  can  employ  the  system 
editor,  or,  more  conveniently  can  use  a  command  procedure,  e.g.  "define"  or  "macro"  in  DEUGHT,  to  specify  a 
unique  correspondence  between  invoking  a  personalised  user  procedure  call  and  invoking  a  sequence  of  one  or 
more  standard  (built  in)  procedure  calls.  Once  the  "define"  or  "macro"  equivalence  lias  been  specified,  it  is 
added  to  that  user's  personal  profile  file  and  maintained  either  for  all  subsequent  use  or  just  during  the 
remainder  of  the  current  user  session. 

The  DEUGHT  environment 

It  would  be  useful  at  this  point,  in  order  to  illustrate  those  aspects  of  the  design  environment  described 
above,  to  consider  the  DELIGHT  (Design  Laboratory  with  Interaction  and  Graphics  for  a  Happier  Tommorrow) 

(6),  as  an  example  of  a  currently  operational  design  environment  which  incorporates  many  of  these  aspects. 
DELIGHT  was  developed  (and  is  still  under  development)  by  Optimization  Based  Computer  Aided  Design  Group  of 
Professor  E.  Polak,  at  the  University  of  California,  Berkeley. 

The  DELIGHT  system  is  modelled  to  a  large  extent  on  the  concept  of  software  tools  as  described  in  (7  ). 

The  system  has  been  programmed  in  Ratfor  (Rational  Fortran)  (7  ),  a  Fortran  pie-processor  which  employs 
structured  programming  concepts  and  other  features  to  improve  software  readability,  ease  of  programming,  etc. 

The  rationale  behind  the  design  of  the  DELIGHT  system  was  the  need  for  a  flexible  system  which  would 
accomodate  a  variety  of  design  situations,  in  particular,  different  system  modelling  and  simulation 
environments,  which  would  be  easy  to  port  to  any  Fortran  environment,  which  would  accomodate  both  experienced 
and  inexperienced  users  and  which  would  be  efficient  in  terms  of  machine  use  yet  allow  the  user  the  maximum 
possible  flexibility  in  expanding  and  modifying  the  available  set  of  software  tools  in  the  system  and  its 
command  language  structure.  Finally,  since  the  system  was  primarily  intended  to  support  an  optimisation-based 
design  methodology,  the  design  language  is  formulated  in  such  a  way  that  optimisation  algorithms  can  be 
directly  implemented  in  the  language. 

The  design  language  in  DELIGHT  is  called  RATTLE  and  is  an  interactive  form  of  the  language  Ratfor  .  As  such  it 
is  sufficiently  similar  to  a  range  of  popular  structured  programming  languages  to  make  it  easy  to  learn,  as 
well  as  ensuring  good  programming  practice.  The  interactive  feature  is  obtained  by  a  rapid  compilation  in  an 
intermediate  form,  using  only  a  single  pass  over  the  source  code.  This  permits  incremental  compilation,  i.e. 
the  direct  compilation  and  execution  of  the  intermediate  code  at  any  point  in  the  code  creation  process, 
without  the  need  for  a  lengthy  linking  and  loading  phase.  Thus  the  language  satisfies  our  requirement  for 
interaction  via  incremental  compilation. 

Since  the  DEUGHT  system  is  not  primarily  intended  for  CAD  of  control  systems,  the  design  language  has  no 
specific  constructs,  data  types,  etc.  which  relate  specifically  to  control  systems  (these  are  currently 
in  the  process  of  being  introduced  into  DEUGHT  in  a  joing  project  between  Berkeley,  Lawrence  Livermore 
Laboratory,  Imperial  College,  London  and  Kingston  Polytechnic).  Nevertheless,  system  descriptions  can  be 
created  directly  in  the  RATTLE  language,  in  the  same  way  as  one  would  write  a  FORTRAN  or  Pascal  based  svstem 
simulation  program.  A  high  level  graphical  design  language  also  does  not  currently  exist  (under  development 
at  the  Lawrence  Livermore  Labs).  However  many  of  the  other  important  features  discussed  earlier  in  the  paper 
as  of  prime  importance  in  a  design  language  exist,  mainly  in  terms  of  its  closely  integrated  relationship  with 
the  other  components  of  the  DEUGHT  environment. 

In  particular,  the  environment  provides  automatic  syntax  checking  of  the  language  statements  on  entry;  for 
example,  if  the  user  enters  s 

if  size  ^  0 

the  system  responds  immediately  with  s 
if  size  >  0 

ERROR  (1)  assignment  syntax  error  (size  >  0) 

indicating  that  the  "if"  statement  is  in  error  -  "if"  must  be  followed  by  a  logical  expression,  which  implies, 
since  spaces  are  used  in  RATTLE  to  indicate  the  end  of  an  expression,  that  the  correct  statement  is  : 

if  size>  0 
or 

if  (size  ^  0) 

In  the  second  case,  parentheses  override  the  end  of  expression  indicated  by  the  blank  after  "size". 

The  editor  in  DEUGHT  can  also  be  used  to  enter  RATTLE  programs  in  wh  ich  case  entry  is  related  to  a  file 
on  the  system  which  is  created  by  the  editor  and  retains  the  code  after  the  end  of  the  terminal  session.  In 
this  case,  however,  the  editor  provides  no  syntax  checking.  This  is  a  current  deficiency  of  DEUGHT  since, 
although  the  editor  may  be  used  to  enter  other  text  files  than  RATTLE  programs,  the  editor  should  give  the 
option  to  the  user  to  include  syntax  checking  on  entry.  Currently  this  can  be  got  around  in  DEUGHT  by  the 
use  of  the  "include"  command.  If  a  file  "fred"  is  created  by  the  editor,  the  command  : 

include  fred 

causes  DELIGHT  to  read  lines  from  the  file  just  as  if  they  had  been  typed  at  the  terminal,  in  which  case 
syntax  checking  is  carried  out  at  this  time.  In  addition,  use  of  the  "echo"  command  causes  DEUGHT  to  type 
each  line  on  the  terminal  as  it  is  being  entered,  so  that  the  user  is  able  to  see  precisely  where  the  errors 
have  occur ed. 

The  DEUGHT  environment  also  cc  tains  a  number  of  interactive  support  tools.  These  include  the  conmands  : 
"display"  -  allows  the  user  to  view  all  the  DEUGHT  symbol  table  entries  which  are  of  a  specified  class  and 
which  match  an  optional  specified  pattern 
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e.g.  display  arrays  a* 

would  list  all  the  user's  arrays  beginning  with  the  letter  "a". 

"display-time"  displays  a  list  of  RATTLE  procedure  names,  the  associated  cpu  times  for  execution  and 
number  of  times  called  so  far. 

"time"  -  displays  total  cpu  time  since  starting  the  DELIGHT  user  session. 

"date"  -  displays  the  current  date  and  time 

"history"  -  displays  the  last  22  lines  of  input  entered  from  the  terminal 

"whatis"  -  together  with  an  item  name  as  an  argument,  displays  the  nature  of  the  item,  e.g.  variable, 
procedure,  etc. 

"list"  -  displays  a  listing  of  a  file  created  in  DELIGHT 

"output-to"  -  copies  all  output  to  a  file,  but  not  to  the  display 

"store"  -  writes  the  binary  values  of  all  DELIGHT  and  user  variables  and  arrays  into  a  optionally 

specified  file  (default  file  is  "memfile")  so  that  everything  may  be  "restored"  at  a  later 
date 

"restore"  -  see  above 

"whoami"  -  displays  the  current  user  name 

The  real  use  of  the  "history"  command  is  not  just  to  view  the  last  22  lines  of  input,  but  to  permit  the 
user  to  re-execute  a  previous  command  or  other  input  line.  This  can  be  done  by  typing  "|"  followed  by 
the  initial  characters  of  the  line  to  be  reissued  or  its  line  number. 

The  DELIGHT  environment  also  includes  powerful  debugging  facilities.  The  major  commands  available  for 
debugging  are  : 

"trace"  -  used  for  debugging  DELIGHT  run-time  errors  after  the  program  has  been  suspend ea  for  some 

reason.  It  prints  a  list  of  called  Rattle  procedures  prior  to  the  program  suspension. 

"enter"  -  used  in  conjunction  with  DELIGHT  commands  such  as  "display"  to  modify,  list,  etc.  local 

variables  in  a  RATTLE  procedure  which  has  been  "entered". 

The  DELIGHT  environment  allows  these  debugging  tools  to  be  used  interactively  by  pi»v.:!„:ig  the  use,  srith 
the  ability  to  interrupt  the  current  program  execution.  Two  kinds  of  interrupts  a.>  available  i  "hard  and 
"soft"  interrupts.  A  "hard"  interrupt  is  generated  when  the  interrupt  key  on  the  terminal  is  pressed  twice 
in  succession.  This  suspends  program  execution  at  the  current  state,  but  allows  the  user  to  enter  new  input 
line  at  a  "second  level"  of  execution,  including  the  debug  comnands  "trace"  and  "enter"  described  above. 

A  further  interrupt  can  be  issued  at  this  level  if  required  in  which  case,  a  "third  level"  of  execution  is 
entered,  and  so  on.  A  "soft"  interrupt  is  generated  when  the  interrupt  key  on  the  terminal  is  pressed  once. 

An  executing  program  can  detect  such  an  interrupt  if  the  word  "interrupt"  has  been  included  in  an  "if" 
statement  within  the  program.  This  statement  can  then  cause  either  the  execution  flow  in  the  program  to 
alter  in  the  normal  way,  e.g. 
if  interrupt  print  x  else  print  y 

or  the  execution  to  the  suspended  by  means  of  the  "suspend"  statement  in  RATTLE,  e.g. 
if  interrupt  suspend 

The  "suspend"  statement  is  a  powerful  feature  of  RATTLE  since  it  allows  many  of  the  facilities  described 
above,  under  the  debugger  heading,  as  desirable  in  an  interactive  design  environment,  e.g.  suspension  of 
execution  on  a  variable  or  set  of  variables  attaining  a  specified  condition.  On  suspension,  the  user  can 
examine  variables  values,  etc.  in  a  lower  execution  level  in  DELIGHT,  e.g.  using  the  "enter"  command,  and 
then  resume  program  execution  from  the  point  at  which  it  was  suspended,  using  the  "resume"  command. 

For  optimisation,  the  DELIGHT  environment  provides  a  particularly  powerful  set  of  procedures  built  into  a 
library  on  the  system.  These  procedures  are  however  simply  written  in  the  RATTLE  language  or  in  RATFOR 
or  FORTRAN,  procedures  in  both  of  the  latter  being  callable  from  RATTLE.  In  fact,  since  DELIGHT  was  primarily 
intended  to  be  an  optimisation  tool  which  could  be  interfaced  to  any  application  dependent  modelling  and 
simulation  environment,  the  design  language  of  DELIGHT,  namely  RATTLE,  is  as  we  have  already  noted,  not 
biased  towards  a  particular  application  environment  such  as  control  system  design,  but  is  if  anything  biased 
towards  the  optimisation  environment.  Hence,  in  the  case  of  DELIGHT,  the  design  language  and  the  optimisation 
language,  as  defined  earlier  in  this  paper,  are  one  and  the  same. 

Two  possible  courses  of  action  therefore  exist  for  extending  DELIGHT  into  an  application  specific  design 
environment.  Firstly  the  design  language  RATTLE  could  be  extended  to  include,  foi  example,  application 
specific  data  types,  e.g.  systems  and  subsystems  for  control  system  design,  and  operations  directly  on  these 
data  types.  This  could  be  carried  out,  to  a  limited  extent  and  not  very  efficiently,  with  the  DELIGHT 
environment  by  using  the  language  extension  facilities  such  as  "defines"  and  "macios",  oi  major  internal 
extensions  could  be  made  to  the  language  parser  and  compiler  to  include  acceptance  and  translation  of  the 
extended  features. 

An  alternative,  which  is  the  current  course  adopted  by  DELIGHT,  is  to  provide  internal  me<  nai  isms  within 
the  environment  which  allow  it  to  be  interfaced  to  another,  application  specific,  modelling  a. id  simulation 
environment.  The  problem  with  this  course  is  that  the  simulation  environment  cannot  be  totally  integrated 
with  the  DELIGHT  environment  in  that  it  may  not  use  the  same  techniques  of  user  interaction  and  would  not 
allow  the  user  to  mix  RATTLE  language  statements  into  the  system  description  which  is  built  in  the  design 
language  of  the  simulation  environment. 
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At  present,  a  project  is  in  operation  (jointly  between  Berkeley,  Lawrence  Livermore  Labs,  Imperial  College, 
London  and  Kingston  Polytechnic)  to  extend  the  DELIGHT  environment  into  a  control  system  design  environment 
using  the  first  course  of  action,  i.e.  using  the  existing  language  enhancement  features  of  DELIGHT  to 
permit  the  user  to  create,  modify  and  execute  system  descriptions  in  the  RATTLE  language  in  as  flexible 
and  easy-to-use  manner  possible,  within  the  constraints  this  course  of  action  imposes.  This  will  require 
the  inclusion  of  RATTLE  commands  which  allow  the  user  to  define  system  descriptions  as  interconnections  of 
subsystems,  execute  these  descriptions  to  produce  time  and  frequency  responses  and  incorporate  optimisation 
procedures  and  system  analysis  procedures  into  the  design  process. 

It  is  a  powerful  feature  in  DELIGHT  which  allows  this  course  of  action  to  be  taken,  that  of  the  ability  for 
the  user  to  extend  and  modify  the  existing  set  of  user  commands  by  means  of  the  "define"  and  "macro"  commands. 
Firstly  we  should  note  that  the  "shell"  in  DELIGHT  is  actually  the  RATTLE  language  itself,  since  commands  are 
implemented  in  DELIGHT  simply  as  procedure  calls  in  RATTLE,  or  in  the  case  of  more  complex  commands  as  a 
combination  of  other  RATTLE  statements  and  procedure  calls.  As  examples,  ct nsider  first  the  DEL1CHT  command 
"include".  This  is  implemented  as  a  single  call  to  the  procedui e  "include"  and  the  correspondence  between 
the  command  and  the  call  is  made  using  the  "define"  statement  in  RATTLE: 

define  (include,  include-  ) 

The  issue  of  this  statement  (carried  out  automatically  at  the  beginning  of  a  DELIGHT  session  as  part  of  a 
default  user  "profile"  for  the  session),  effectively  replaces  all  references  to  the  command  "include"  by  a 
call  to  the  procedure  "include-". 

Consider  also  the  command  "whoami"  This  is  translated  by  the  DELIGHT  system  into  a  set  of  RATTLE  statements 
by  means  of  the  "macro"  statement  in  the  RATTIE  language  : 

macro  whoami  ( 
import  user-name- 

print  f  'You  are  %p.  You  are  running  DELIGHT. /n*  user-name- 

) 

The  "macro"  statement  simply  replaces  the  reference  to  "whoami"  by  the  block  of  RATTLE  statements  within  the 
pair  of  brackets.  Note  that  the  length  of  the  macro  is  unlimited,  so  complex  operations  can  be  carried  out. 
Also,  both  macros  and  defines  can  accept  arguments  to  the  command  as  well  as  simple  commands,  as  in  the 
command  to  list  a  file,  for  example  : 

list  fred 

In  summary,  therefore,  DELIGHT  offers  a  good  example  of  an  integrated  design  environment,  which  whilst  not 
being  application  specific  to  the  area  of  control  systems  CAD,  offers  many  of  the  major  interactive  features 
and  components  to  be  expected  in  such  an  environment  and  the  expandability  to  make  it  application  specific, 
albeit  in  a  limited  and  not  highly  efficient  way.  A  control  system  CAD  environment  could  well  employ  many  of 
the  features  of  DELIGHT  but  would  have  in  addition  a  design  language  which  might  take  the  form  of  an  extended 
version  of  RATTLE,  with  implicit  data  types  and  language  constructs  which  mote  closely  relate  to  the 
application  and  hence  make  an  easier-to-use  user  interface  for  the  creation  and  modification  of  system 
descriptions. 

In  addition,  the  next  generation  of  CAD  environments  must  take  into  full  consideration  those  human  factors 
aspects  which  are  essential  to  the  design  of  a  good  user  interface.  As  the  final  section  of  this  paper,  we 
will  now  briefly  review  some  of  these  aspects. 

Human  factors  considerations  in  user  interface  design 

No  matter  how  powerful  the  set  of  design  tools  are  in  a  control  system  CAD  environment,  the  successful  use 
of  these  tools  will  depend  to  a  large  extent  on  the  quality  of  the  user  interface.  In  order  to  ensure  that 
the  user  interface  is  designed  in  such  a  way  that  human  errors  are  minimized,  it  is  necessary  for  the 
designer  of  the  interface  to  be  aware  of  and  sensitive  to  the  abilities  and  needs  of  users,  from  the  novice 
user  to  the  expert.  It  is  also  important  when  potential  users  of  CAD  systems  come  to  judge  competetive 
systems  for  selection  for  use  in  their  organisation,  that  they  too  are  aware  of  the  factors  which  will 
determine  to  a  great  extent  the  degree  of  acceptance  and  benefit  which  will  be  achieved  in  respect  of  each 
system  due  to  easy  of  use,  freedom  from  human  error,  etc. 

A  recent  paper  by  Norman  (8  )  classifies  the  types  of  human  error  which  can  occur  in  the  use  of  any  computer 
system,  including  a  CAD  system  and  analyses  the  relationship  of  these  errors  to  a  set  of  derived  design 
guidelines  for  the  user  interface.  In  the  remainder  of  this  paper,  we  will  review  those  various  classes  of 
human  error  and  relate  them  to  the  characteristics  and  components  of  the  proposed  integrated  design  support 
environment  discussed  earlier  in  the  paper  using  the  DELIGHT  environment  to  demonstrate  various  aspects  where 
appropriate. 

Based  on  Norman's  analysis,  we  can  classify  the  classes  of  human  error,  in  this  case  errors  in  carrying  out 
a  given  intention  rather  than  error  in  formulating  the  intention,  as  follows. 

Mode  errors  -  these  are  errors  which  result  from  the  user  misunderstanding  the  current  state  of  the  system 
and  as  a  result  carrying  out  an  action  appropriate  to  a  mode  of  operation  other  than  the  one  currently 
applying.  This  implies  the  need  either  to  eliminate  different  modes  entirely  or  to  make  the  modes  highly 
distinctive.  The  first  alternative  is  difficult  to  accomplish  fully  in  a  CAD  environment  which  has  to 
accomplish  a  wide  range  of  complex  tasks.  Therefore  it  would  appear  that  the  preferred  solution  is  to 
abolish  different  modes  where  they  are  unnecessary  and  to  make  the  modes  highly  distinctive  when  they  have  to 
be  incorporated.  The  implication  for  the  CAD  environment,  clearly  exemplified  by  DELIGHT,  is  that  as  far  as 
possible  the  user  should  communicate  his  design  procedures  to  the  CAD  system  using  a  single  language,  for 
issuing  commands  to  create  and  modify  the  system  description,  for  analysing  and  debugging  the  results, 
etc.  Where  it  is  unavoidable,  other  modes  can  be  introduced  of  a  highly  distinctive  nature,  for  example,  a 
graphical  high  level  design  language  for  describing  structural  or  topological  relationships  in  the  system, 
since  interaction  via  a  light  pen  or  mouse/tablet  combination  is  sufficiently  distinctive  to  avoid  mode 


errors.  Where  keyboard  interaction  is  envisaged  in  combination  with  and  as  part  of  the  graphical  interaction, 
this  must  be  made  either  sufficiently  distinctive  to  avoid  error,  or  must  be  identical  to  the  noi mal  form  of 
interaction  via  the  textually  based  design  language,  i.e.  the  latter  language  should  be  the  one  used. 

A  second  class  of  errors  is  generated  if  the  CAD  system  offers  a  range  of  very  similar  commands  which  can  be 
easily  confused  and  yet  lead  to  vastly  different  results.  A  good  example,  quoted  by  Norman,  of  this 
situation  is  provided  by  the  Berkeley/lJNIX  "vi"  screen  editor  where  each  of  the  letters  d,  f,  g  and  u  has  a 
different  meaning  if  typed  in  lower  case,  upper  case  or  in  combination  with  the  "control"  key.  Another 
example  which  is  very  common  is  related  to  the  confusion  which  can  occur  if  closely  related  arguments  in  a 
command  can  be  wrongly  ordered  with  possible  disasterous  consequences,  as  in  a  file  "copy"  command  common  in 
many  systems,  especially  if  the  source  file  overwrites  the  destination  file  without  adequate  checks,  e.g.  that 
the  source  file  is  not  empty.  The  implications  for  the  design  of  CAD  environments  is  broadly  to  make  commands 
distinctive  and  where  arguments  occur  in  commands  to  separate  similar  arguments  and  provide  consistency 
checking.  This  is  especially  difficult  to  ensure  however  in  an  environment  such  as  DELIGHT  which  allows  the 
user  to  create  new  commands,  with  arguments  if  required.  The  rules  here  then  apply  to  the  user  as  well  as 
the  initial  designer  of  the  CAD  environment. 

A  further  class  of  errors  results  from  lack  of  consistency  when  a  user  attempts  to  use  experience  to  derive  a 
new  command  sequence  and  the  correct  form  for  the  new  sequence  is  inconsistent  with  that  prior  experience. 

The  requirement  here  is  for  the  language  structure  used  in  the  CAD  environment  to  be  consistent  within  its 
own  context,  but  also  with  languages  which  the  user  may  already  be  familiar  with.  This  is  achieved  to  great 
effect  in  DELIGHT  where  the  design  language  RATTLE  is  highly  consistent  with  RATKOR  and  other  block  structured 
languages.  Since  RATFOR  is  the  language  in  which  procedures  can  be  written  which  are  callable  from  RATTLE, 
it  is  additionally  likely  that  the  user  will  not  suffer  from  lack  of  consistency  in  moving  from  one  to  the 
other. 

If  a  command  in  the  environment  is  formed  by  the  simple  modification  of  another,  this  can  also  lead  to  a 
further  class  of  errors  especially  if  the  unmodified  command  is  used  much  more  frequently  than  the  other. 

In  this  case,  the  error  which  occurs  is  that  in  the  infrequent  case  in  which  the  user  wishes  to  issue  the 
modified  command,  he  nevertheless  issues  the  unmodified  one  due  to  an  unconscious  reflex  born  from  familiarity. 

The  lesson  to  be  learned  here  is  not  to  make  commands  "overlap"  in  this  way,  but  as  in  the  earlier  case,  make 

all  commands  distinct.  A  possible  source  of  error  of  this  class  exists  in  DEUGHT  in  the  commands  "echo— oi— to" 
and  "echo-to",  although  not  with  disasterous  results. 

Finally,  a  major  class  of  errors  are  so-called  "activation"  errors,  i.e.  errors  resulting  from  either  the 
activation  of  an  inappropriate  action  or  failure  to  activate  an  appropriate  action.  The  first  type  of 
activation  errors  occur  if  the  CAD  system  presents  misleading  information,  e.g.  inaccurate  error  reports  or 
ambiguous  status  reports,  causing  the  user  to  take  a  path  of  action  inappropriate  to  the  real  condition.  This 
clearly  implies  the  need  for  accurate,  unambiguous  display  of  information  by  the  system.  However,  since 
errors  of  this  type  are  unavoidable  to  some  extent,  especially  for  novice  users,  it  is  important  to  allow  the 

user  to  "undo"  the  sequence  of  inappropriate  actions  once  this  has  become  apparent,  or  at  least  be  able  to 

view  a  record  of  the  sequence  and  take  independent  action  to  return  to  the  initial  condition.  Hence  the 
requirement  for  the  "history"  facility  in  the  CAD  environment. 

This  facility  is  also  desirable  to  cope  with  the  second  type  of  activation  error,  failure  of  the  user  to 
activate  an  appropriate  action.  This  generally  results  from  memory  failure  occurring  between  the  time  when 
the  intention  to  carry  out  an  operation  is  formed  and  its  final  execution.  The  use  of  a  "history"  list 
permits  the  user  to  refresh  his  memory,  especially  when  the  occurence  of  intermediate  actions,  e.g.  graphical 
output,  listing  of  a  file,  has  obliterated  the  normal  terminal  record  of  previous  actions.  Alternatively,  or 

in  addition,  the  use  of  multiple  windows  on  the  display  screen  of  the  terminal  can  be  used  to  great  effect  to 

overcome  this  type  of  error,  e.g.  by  maintaining  separate  windows  for  graphics,  listing  files,  etc.  and  for 
command  sequences,  allowing  the  windows  to  overlap,  reappear,  etc.  as  required  by  the  user. 

In  this  section  of  the  paper  we  have  given  a  review  of  some  of  the  causes  of  human  error  in  using  CAD  systems. 
It  is  clear  that  the  design  of  the  environment  proposed  in  this  paper  must  incorporate  a  proper  consideration 

of  these  factors  and  incorporate  features  in  the  user  interface  design  which  will  avoid  or  minimise  the 

occurence  of  human  error  as  far  as  possible.  These  features  can  be  broadly  classified  as  (8)* 

feedback  -  an  unambiguous,  accurate  and  readily  available  display  of  information  on  the  current  state 

of  the  system 

distinctive  modes-  a  provision  for  the  user  to  clearly  distinguish  between  commands  or  other  language 
and  actions  structures  or  modes  of  use  vf  the  system. 

consistency  -  an  adherence  of  all  parts  of  the  system  to  a  conrnon  command  or  language  structure  and  the 

close  relationship  to  this  to  other  language  structures  likely  to  be  familiar  to  the  user 

reversibility  -  an  ability  for  all  actions,  where  ever  possible,  to  be  reversed,  and  failing  this  a 
provision  of  obstacles  to  the  initiation  of  irreversible  actions. 

Conclusions 

In  this  paper  we  have  attempted  to  portray  something  of  a  vision  of  the  future.  The  present  state  of 
development  of  control  system  CAD  tools  is  such  that,  after  a  long  period  of  refinement  of  the  methods  of 
design  and  the  more  recent  attention  to  the  efficiency  and  reliability  of  the  implementation  of  these 
tools,  we  must  now  pay  careful  attention  to  the  total  environment  in  which  the  design  tools  are  to 
be  used  by  the  design  engineer.  This  requires  consideration  of  the  set  of  software  components  that  this 
environment  must  incorporate  over  and  above  the  basic  design  tools,  and  of  the  user  interface  which  the 
environment  presents,  from  the  viewpoint  of  increasing  the  ease  of  use  of  the  design  tools,  in  particular 
providing  freedom  from  unnecessary  human  error. 

We  expect  that  integrated  design  support  environments  of  the  'ype  envisaged  here  will  appear  over  the 


next  few  years  in  greater  number  and  with  a  greater  level  of  sophistication  than  hitherto  and  lead  to  a 
greater  willingness  on  the  part  of  the  industrial  control  system  designer  to  make  full  use  of  the  powerful 
design  methods  which  are  now  available. 
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Abstract 

Systematic  methods  for  design  of  control  systems  require  mathematical  models  of  the 
dynamics  of  processes  and  disturbances.  This  lecture  is  an  overview  of  techniques  for 
obtaining  such  models.  Modeling  from  first  principles  and  modeling  from  data  are 
discussed.  Particular  emphasis  is  given  to  computer  aided  tools  for  obtaining  and 
verifying  the  models.  Two  interactive  software  packages  Simnon  for  nonlinear  simulation 
and  Idpac  for  data  analysis  and  ident i f icat ion  are  described.  The  paper  ends  by  some 
speculation  on  the  tuture  trends. 


1.  INTRODUCTION 

The  design  of  a  control  system  is  frequently  divided  into  two  steps!  determination 
of  a  mathematical  model  and  design  of  a  control  strategy.  In  fact  most  of  the  control 
theory  that  has  been  developed  postulates  that  a  model  of  the  system  and  its  environment 
is  available.  To  use  much  of  the  existing  control  theory  it  is  therefore  necessary  to 
have  techniques  to  determine  suitable  models  for  the  processes  to  be  controlled. 

Before  the  advent  of  modern  control  theory  most  results  were  restricted  to  linear 
systemsi  assuming  a  model  specified  by  a  transfer  function.  The  modeling  then  reduces  to 
the  determination  of  transfer  functions.  This  is  conveniently  done  experimentally  by 
introducing  a  sinusoidal  variation  in  the  input  and  measuring  amplitude  and  phase  between 
the  input  and  the  output. 

A  characteristic  feature  of  many  significant  results  in  control  theory  that  have 
been  developed  over  the  past  30  years  is  that  they  require  other  models  than  transfer 
functions.  Typically .  many  of  the  results  of  modern  theory  assume  that  the  system  is 
described  by  time  domain  models  like 

dx 

--  =  f  <x.  u»  v) 


<1) 

y  =  g  <  x  »  u>  v  > 

where  u  is  the  inputi  y  the  output!  x  the  state  variable  and  v  a  disturbance. 

To  apply  the  results  of  modern  control  theory  it  is  therefore  necessary  to  have 
techniques  available  to  determine  models  like  Eq .  <1).  In  this  paper  we  will  outline  some 
progress  towards  the  solution  of  this  problem. 


2.  MODELS  AND  MODELING 

The  notion  of  a  mathematical  model  is  fundamental  to  science  and  engineering*.  A 
model  is  a  very  useful  and  compact  way  to  summarize  our  knowledge  about  a  process.  A 
model  is  also  a  very  effective  tool  for  education  and  communicat ion.  For  control 
engineering!  models  are  significant  because  virtually  all  existing  control  theory  is 
based  on  the  assumption  that  mathematical  models  of  the  process'  its  environment  and  the 
criterion  are  available.  The  models  are  also  used  to  select  the  structure  of  the  control 
systemi  appropriate  sensors  and  actuators.  They  are  also  useful  for  process  design. 

It  is  important  to  emphasize  the  danger  of  believing  that  a  process  can  be 
cha racteri zed  by  one  mathematical  model.  It  is  much  more  fruitful  to  represent  a  process 

with  a  bierarghy_gf _ models i  ranging  from  very  detailed  and  complex  simulation  models  of 

whole  processes  to  the  ’back  of  an  envelope  model’  which  is  easily  manipulated 
analyt ical ly .  The  simple  models  are  used  for  exploratory  purpose  and  to  obtain  the  gross 
features  of  the  system  behaviour.  The  very  complicated  simulation  models,  which  also  may 
contain  pieces  of  the  real  process,  are  used  for  a  detailed  check  of  the  control  system 
to  make  sure  that  nothing  has  been  neglected.  The  complicated  models  take  a  long  time  to 
develop  and  they  are  costly  to  maintain.  They  do.  however,  reproduce  the  properties  of 
the  real  system  with  high  fidelity  and  they  are  a  necessity  for  design  of  critical 
processes.  Between  the  two  extremes  there  may  be  many  different  types  of  models  which  are 
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suited  for  design  of  control  systems.  The  crucial  problem  is  to  steer  between 
overs  itvipl  i  f  icat  ion  with  the  danger  of  disaster  and  ove  rcomplication  which  is  too 
expensive.  The  trademark  of  good  engineering  is  to  choose  the  right  model  for  each 
specific  purpose. 

There  are  in  principle  two  different  sources  from  which  models  can  be  obtained)  from 
prior  experiences  in  terms  of  physical  lawsi  (modeling  from  physics'  or  by 
experimentation  on  a  process  ' identification).  When  it  is  attempted  to  obtain  a  specific 
model  it  is  of  course  beneficial  to  combine  both  approaches. 

Classical  control  theory  was  based  on  the  idea  to  model  a  dynamical  system  by  a 
transfer  function  or  an  impulse  response.  Such  a  model  is  referred  to  as  an 
§><ternai_descr ipt ion  or  a  b 1 ack ;box _mode 1  because  it  gives  only  a  relation  between  the 
system  input  and  output  i.e.  the  external  variables.  The  success  of  the  classical  control 
theory  can  partly  be  attributed  to  the  fact  that  there  were  powerful  experimental 
techniques)  frequency  and  transient  response  analysis)  which  made  it  possible  to  obtain 
the  appropriate  models.  These  classical  methods  for  system  l dent l f icat i on  are  still  very 
useful  for  process  modeling  and  they  should  always  be  kept  in  mind  even  if  they  have 
largely  vanished  from  most  current  papers  on  automatic  control. 

The  so-called  modern  control  theory  is  largely  based  on  a  process  model  in  terms  of 
a  state-equat i on .  This  is  called  ah  i nternal _model.  or  a  white-bgx_mgdei  because  the  state 
model  describes  explicitly  all  the  internal  couplings  between  the  inputs)  outputs  and  the 
state-variables.  The  problem  of  obtaining  suitable  internal  descriptions  for  different 
process  is  one  of  the  major  problems  in  applications  of  modern  control  theory.  In 
aerospace  applications  the  desired  models  can  sometimes  be  derived  from  basic  physical 
laws.  When  this  is  not  possible  process  it  is  necessary  to  use  experiments.  Much  of  the 
current  research  in  system  ident i f icat ion  has  been  inspired  by  the  desire  to  obtain 
process  models  from  process  experiments. 

In  control  system  design  it  is  also  important  to  have  models  of  disturbances.  The 
external  models  are  often  given  in  terms  of  spectral  densities  and  covariance  functions. 
When  using  internal  representations  the  disturbances  are  instead  represented  as  outputs 
of  dynamical  systems  driven  by  white  noise.  Models  for  disturbances  can  only>  rarely)  be 
determined  from  first  principles.  Process  experiments  combined  with  system  identification 
is  thus  often  the  only  possibility  to  model  disturbances. 

3.  MODELING  FROM  PHYSICS 

The  required  models  can  in  principle  be  derived  from  basic  physical  laws  expressing 
conservation  of  mass)  momentum  and  energy)  combined  with  material  equations  like  Boyle’s 
law  or  Hooke’s  law.  The  models  obtained  in  this  way  have  the  advantage  of  a  wide  range  of 
validity.  Usually)  they  also  provide  a  good  insight  into  the  behaiviour  of  the  system. 
The  drawbacks  of  modeling  from  physical  laws  are  that?  the  required  knowledge  is  not 
always  available!  the  modeling  is  frequently  time-consuming  (consider  the  required  to 
develop  Newtonian  mechanics) i  and  it  is  often  difficult  to  make  sensible  approx imat ions . 
A  typical  difficulty  is  to  find  good  approx imat ions  of  distributed  parameter  systems. 
Experience  has  shown  that  the  models  developed  from  basic  physical  principle  tend  to  be 
complex.  A  complex  model  indirectly  implies  a  complex  control  strategy  and  vice  verse.  If 
a  system  can  be  successfully  controlled  by  a  simple  strategy  it  can  probably  also  be 
modeled  sat isfacor i 1 y  by  a  simple  model.  In  the  area  of  flight  control  systems  rigid  body 
dynamics  is  well  understood  in  the  sense  that  models  can  conveniently  be  derived  from 
physical  principles.  The  models  are  also  supported  by  experimental  data  from  wind  tunnel 
experiments.  On  the  other  hand  phenomena  like  flutter)  ae roelast ici ty  and  large  angle  of 
attach  behavior  are  not  sufficiently  well  known  for  models  suitable  for  control  to  be 
derived  from  physical  principles  alone. 

The  problem  of  obtaining  a  model  like  Eq .  (1)  For  a  dynamical  system  is  sometimes 
called  the  inverse  problem  because  a  solution  is  given  and  the  problem  is  to  find  the 
equation  which  has  the  given  solution.  Problems  of  this  type  do  of  course  arise  in  many 
fields:  biolqy)  medicine)  economy)  physics  and  chemistry.  There  are  certain  advantages  in 
the  modeling  and  ident i f icat ion  problems  originating  from  the  field  of  automatic  control. 
There  is  a  specific  purpose  in  doing  the  modeling  (design  of  control  strategies).  It  is 
often  fairly  easy  to  do  experiments.  (Control  systems  are  designed  in  such  a  way  that 
control  variables  can  be  manipulated  and  outputs  measured.) 

Strictly  speaking)  the  problem  may  hot  be  so  well  defined.  Even  if  the  design  of  a 
control  system  is  the  final  goal)  it  is  of  course  valuable  to  have  insight  and 
understanding  of  system  properties  which  do  not  enter  the  control  design  directly.  The 
possibility  of  making  experiments  may  be  limited  because  it  may  be  necessary  to 
experiment  under  normal  operating  conditions)  and  large  changes  in  inputs  may  be 
prohibitive  for  safety  and  economic  reasons. 

Process  models  can  be  obtained  from  basic  physical  laws)  from  pure  input-output 
experiments  or  from  a  combination  of  these  approaches.  Determination  of  a  model  from 
input-output  measurements  only  has  the  advantage  of  being  done  quickly.  Experience  has 
also  shown  that  it  usually  leads  to  fairly  simple  models.  One  serious  disadvantage  is 
that  in  most  methods  it  is  possible  to  determine  linear  models  only.  This  means  that  the 
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validity  of  the  model  is  limited.  A  change  in  operating  conditions!  input  signals!  etc.! 
may  thus  lead  to  a  different  model.  Another  disadvantage  is  that  available  a  priori 
knowledge  is  not  used.  For  example!  it  is  almost  impossible  to  exploit  a  priori  knowledge 
when  a  transfer  function  is  determined  using  frequency  response  methods.  Noweveri  in  som 
cases  the  input-output  approach  may  be  the  only  possibility.  This  may  be  the  case  in  the 
cha racte ri zat ion  of  disturbances  such  as  load  variations.  Recognizing  the  advantages  and 
d isadvantages  of  modeling  from  physical  equations  and  from  input-output  experiments  alone 
it  seems  highly  desirable  to  try  to  exploit  both  methods  in  order  to  solve  the  modeling 
problem. 


4.  MODELING  FROM  DATA 

Techniques  for  determining  dynamic  models  from  experimental  data  is  called 
ident if icat ion  in  the  control  engineering  literature.  A  brief  review  is  given  in  this 
section.  For  more  details  we  refer  to  the  survey  papers  Cl]  and  C2]  i  the  books  C3]  i  C4]  i 
t51 i  C6]  and  the  proceedings  from  the  IFAC  Symposia  on  system  identification  in  Prague 
1967!  1970i  the  Hague  1973i  Tbilisi  1975i  Darmstadt  1979  and  Washington  1982. 

It  was  mentioned  in  the  introduction  that  ident i f icat ion  was  the  experimental  aspect 
of  process  modeling.  In  particular  system  ident i f icat ion  includes 

1.  Experimental  planning 

2.  Selection  of  model  structure 

3.  Parameter  estimation 

4.  Validation 

Experimental  planning  includes  the  decision  to  make  open-  or  closed-loop  experiments! 
selection  of  input  signals  and  sampling  rates.  It  also  includes  considerations  of  many  of 
the  practical  problems  that  are  associated  with  experiments  in  an  industrial  environment. 
The  experiment  will  result  in  data  D  in  the  form  of  records  of  inputs  and  outputs  from 
the  process.  The  selection  of  model  structure  is  frequently  based  on  physical  principles 
or  on  a  priori  knowledge  of  the  process  dynamics.  The  purpose  of  the  parameter  estimation 
is  to  determine  the  parameters  of  the  model  based  on  the  experimental  data.  Model 
validation  is  the  procedures  used  to  ensure  that  the  model  obtained  is  reasonable.  This 
frequently  requires  more  experiments. 

In  practice  the  procedure  is  iterative.  When  i nvest igat i ng  a  process  where  the  a 
priori  knowledge  is  poor  it  is  reasonable  to  start  with  transient  and  frequency  response 
analysis  to  get  crude  estimates  of  the  dynamics!  the  region  of  linearity!  and  the 
disturbances.  Based  on  these  results  it  can  then  be  attempted  to  derive  physical  models 
where  the  results  of  the  frequency  response  analysis  are  used  to  guide  various 
approximations.  The  results  of  the  preliminary  investigation  can  then  be  used  to  plan 
suitable  experiments  where  the  plant  is  perturbed  and  the  output  observed.  The  data 
obtained  are  then  used  to  estimate  the  unknown  parameters.  New  experiments  are  done  for 
the  validation.  Based  on  the  results  and  the  experience  obtained!  the  model  may  be 
improved  and  new  experiments  can  be  planned  etc. 

The  different  phases  of  system  identif ication  will  now  be  discussed  in  more  detail. 


iSB§I!im®Dtai_glanning 

It  is  often  difficult  and  costly  to  perform  experiments  on  real  processes.  It  has 
therefore  been  a  desire  to  develop  methods  that  will  relax  the  constraints  on  the 
experiments  at  the  expense  of  increased  computat ions .  While  many  classical  methods 
depended  strongly  on  the  input  to  have  a  precise  form  the  newer  techniques  can  handle 
virtually  any  type  of  input  signal.  The  only  requirements  on  the  input  signal  is  that  it 
should  excite  all  the  modes  of  the  process  sufficiently  (persistent  excitation). 

There  is  a  substantial  literature  on  the  planning  of  statistical  experiments  C7D ! 
C81 .  The  purpose  is  to  find  optimal  designs  of  experiments.  In  process  modeling  this 
corresponds  to  finding  optimal  input  signals.  Considerable  research  has  been  devoted  to 
this  problem  C9] i  C101.  All  results  on  optimal  input  design  arei  however!  based  on  the 
assumption  that  a  model  of  the  process  is  known.  This  means  that  the  results  can  only  be 
used  when  a  reasonably  good  a  priori  knowledge  of  the  dynamics  of  the  process  and  its 
environment  is  available.  Good  applications  to  determination  of  aircraft  flight  dynamics 
and  ship  steering  dynamics  are  known.  The  results  may!  howeveri  also  be  strongly 
misleading  if  the  process  dynamics  differ  from  the  a  priori  assumptions.  The  results  on 
design  of  optimal  inputs  are  also  restricted  because  it  is  frequently  assumed  that  the 
process  is  open  loop  during  the  experiment. 

The  possibility  to  base  system  ident i f icat ion  on  data  obtained  under  closed  loop 
control  of  processes  has  been  explored  recently  Ill].  The  results  obtained  are  very 
useful  from  the  point  of  view  of  appl icat ions .  The  main  difficulty  with  data  obtained 
from  a  process  under  feedback  is  that  it  may  be  impossible  to  determine  the  desired 
models  i.e.  lack  of  ident i f iabi 1 ity .  It  has!  howeveri  been  demonstrated  that 
identif iabil ity  can  be  recovered  if  the  feedback  is  sufficiently  complex.  It  helps  to 
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make  the  feedback  nonlinear*  time-varying  and  to  change  the  set  points-  A  practical  way 
to  make  the  feedback  t ime-va riabl e  is  to  switch  between  different  linear  feedbacks.  There 
are  cases  where  data  from  closed  loop  experiments  will  give  better  results  than  open  loop 
experiments  C113.  A  practical  way  to  arrange  the  experiments  is  to  provide  the  process 
with  a  self-tuning  regulator  to  minimize  the  fluctuations  in  process  variables  and  to 
change  the  set-point  of  the  regulator  with  as  large  signal  as  passible. 


Model  structures 

The  model  structures  used  are  derived  from  prior  knowledge  of  the  dynamics  of  the 
process  and  its  environment. 

In  some  cases  the  only  a  priori  knowledge  available  is  that  the  process  can  be 
described  as  a  linear  system  in  a  parcticular  operating  range.  It  is  then  natural  to  use 
general  representations  of  linear  systems  i . eg .  black  box  model.  Typical  examples  of 
black  box  models  are  the  transfer  function  model 

U(s)  =  G(s)U(s)  +  H(s)Ets)  (.2) 


and  the  difference  equation  model 

Yl't)  +  A  y(t-l)  +...+  A  y(t-n)  = 

1  n 

=  B  uCt-li  +  ...+  B  uCt-n)  +  e<t)  +  C  e(t-l>  +  ...+  C  eCt-n) 
1  u  1  u 


(3) 


where  u  is  the  input*  y  the  output  and  e  is  a  white  noise  disturbance.  The  parameters  as 

well  as  the  order  n  in  the  vector  difference  Eq.  C3>  are  considered  as  unknown 
parameters. 

Sometimes  it  is  possible  to  apply  known  physical  laws  to  derive  models  of  the 
process  which  anly  contain  a  few  parameters.  For  lumped  parameter  processes  such  white 
box  models  may  be  of  the  form 
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where  u  is  the  input*  y  the  output*  x  the  state*  e  and  v  disturbances  and  0  a  vector  of 
unknown  parameters.  Linear  models  with 


f  (  X  *  u  * 

v*  9; 

=  A  C  9 )  x 

+  B(0)u  +  v 

g  <  x  *  u  * 

e*  0  j 

=  c  <:  e )  x 

+  D  (  0 )  u  +  e 

C5J 


are  particularly  common. 

For  distributed  parameter  processes  the  model  given  by  Eq .  (4)  is  replaced  by  a  partial 

differential  equation. 

In  many  practical  cases  the  models  may  be  composed  of  parts  which  are  black  box  models 
and  parts  which  are  white  box  models.  Such  models  are  called  grey_box_mgdels .  Notice  that 
a  significant  trend  in  the  recent  development  is  to  attempt  to  model  both  the  process 
dynamics  and  the  disturbances.  This  is  of  course  in  close  agreement  with  the  needs  of  the 
control  engineer  because  without  disturbances  there  is  no  control  problem. 


Criteria 

When  formulating  an  ident l f icat ion  problem  a  criterion  is  introduced  to  give  a 
quantity  expressing  how  well  a  model  M  fits  the  experimental  data  D.  The  criteria  can  be 
oostulated.  By  making  statistical  assumptions  it  is  also  possible  to  derive  criteria  from 
probalistic  arguments.  Criteria  can  therefore  be  viewed  from  two  points  of  view.  They  are 
often  expressed  as 
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or  for  discrete  time  systems 


v<e)  =  2  hrt:,J  <7> 

t=0 

where  e  is  the  input  erron  the  output  error  or  the  generalized  error.  See  Ell.  The 
prediction  error  is  a  typical  example  of  a  generalized  error.  The  function  h  is 
frequently  chosen  as  a  quadratic  but  it  is  also  possible  to  have  many  other  forms. 

Particularly  it  may  be  useful  to  have  functions  which  do  not  grow  as  rapidly  as  e~  for 
large  e.  See  C12]  . 

The  first  formulation!  solution  and  application  of  an  ident i f icat ion  problem  was 


given  by  Gauss  in  his  famous  determination  of  the  orbit  of  the  planet  Ceres  C131.  Gauss 
formulated  the  ident i f icat ion  problem  as  an  optimization  problem  and  introduced  the 
principle  of  least  squares.  Ever  since!  the  least  squares  criterion  has  been  used 
extensively.  Nowadays  the  least  squares  method  <LS)  commonly  refers  to  a  method  where  not 
only  the  criterion  is  quadratic  but  also  the  model  is  such  that  the  errors  (i.e.  the 

differences  between  the  observed  and  computed  values)  are  iinear_in the_pa rameters .  The 

solution  of  the  problem  can  then  be  given  in  closed  form.  It  should!  howeven  always  be 
remembered  that  least  squares  is  often  chosen  for  mathematical  convenience. 

Because  of  the  simplicity  of  the  least  squares  problem  it  is  always  the  tempting  to 
use  this  formulation.  It  isi  howeven  useful  to  remember  that  if  the  identification 
problem  is  solved  using  a  digital  computer  there  is  no  particular  reason  to  choose  a 
quadratic  criterion.  When  the  disturbances  of  a  process  are  described  as  stochastic 
processes!  the  identification  problem  can  be  formulated  as  a  statistical  parameter 
estimation  problem  and  the  whole  artillery  of  statistical  estimation  methods  become 
available.  The  maximum  likelihood  method  is  a  popular  technique  which  has  many  attractive 
statistical  properties.  See  e.g.  C141i  C151  and  C1&].  This  method  can  also  be  interpreted 
as  a  least  squares  criterion  if  the  quantity  to  be  minimized  is  taken  as  the  sum  of 

squares  of  the  prediction _ errors  or  more  precisely  in  the  case  of  discrete  time 

observations  at  times  t  i  t  i...it  the  criterion  is  given  by 

0  1  N 

N 

1  r  T  -1  Np 

V ( 0 )  =  N/2  log  det  R  +  -  /  e  <t';R  eCt  )  —  log  211  (3) 

2  L  i  i  2 

1 

where  eft  )  are  the  prediction  errors 


elt  )  =  yCt  )  -  y Ct  |t  )  C9) 
i  l  i  l-l 

The  maximum  likelihood  criterion  Eq .  (8)  is  based  on  the  assumption  that  the  prediction 
errors  e  are  normally  distributed.  Notice!  howeven  that  the  criterion  Eq .  (9)  can  still 
be  postulated  even  if  the  prediction  errors  are  not  normal.  The  corresponding 
identification  method  then  becomes  a  prediction  error  method. 


Eajcams£sx_ssii!i!3fei9D_'i!Sib9£[s 

The  parameter  estimation  paroblem  can  be  formulated  as  follows.  Given  data  D!  in  the 
form  of  input-output  records  from  a  process!  class  of  models  M  and  a  criterion  C.  Find  a 
model  in  the  class  M  which  fits  the  data  as  well  as  possible  according  to  the  criterion 
C.  There  are  many  possibilities  to  combine  experimental  conditions!  model  classes  and 
criteria.  There  are  also  many  different  ways  to  organize  the  calculations  of  the 
est  imate.  Consequently  there  is  a  large  number  of  different  identification  methods 
available.  It  is  useful  to  remember!  however i  that  they  are  all  based  on  the  same 
principle  and  that  they  only  differ  in  the  choice  of  model  structures!  criteria  and 
organization  of  the  ca leu lat ions . 

One  broad  distinction  is  between  on-^ine  methods  and  off-line  methods.  The  on-line 
methods  give  estimates  in  real  time  as  the  measurements  are  obtained.  The  on-line  methods 
are  the  only  alternative  if  parameters  are  timevarying  and  when  the  estimates  must  be 
produced  in  real  time.  The  on-line  algorithms  available  are  also  frequently  simpler  to 
program  than  the  off-line  methods.  The  draw-back  with  the  on-line  methods  is  that  they 
are  less  reliable.  They  may  not  necessarily  converge.  Even  if  they  converge  they  may 
converge  to  the  wrong  solution.  In  many  cases  the  off-line  methods  will  also  give 
est  imates  with  higher  precision.  Off-line  techniques  are  therefore  preferrable  unless  the 
processes  are  timevarying  or  it  is  necessary  to  obtain  estimates  in  real  time. 
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Choice_g£_methgds 

The  large  number  of  identif ication  methods  available  are  of  course  very  confusing 
for  an  industrial  engineer  who  is  primarily  interested  in  having  a  tool  to  obtain  a 
model.  Several  attempts  to  compare  different  ident 1 f icat ion  methods  have  also  been  made. 
See  e.g.  C173  and  C193.  The  comparisons  are  largely  inconclusive  in  the  sense  that  there 
is  no  method  that  is  universally  best.  Fortunately  it  appears!  however!  that  the  choice 
of  techniques  is  not  crucial.  Personally  I  would  recommend  a  prospective  user  to  learn 
the  classical  methods  (frequency  and  transient  response  analysis!  correlation  and 
spectral  analysis)!  least  squares  with  extensions  and  maximum  likelihood.  The  least 
squares  method  is  very  simple  and  easy  to  understand.  Under  some  circumstances  it  will 
give  estimates  with  the  wrong  mean  values  (bias).  This  cam  howeveri  be  overcome  by  using 
various  extended  least  squares  and  generalized  least  squares.  The  major  drawback  by  least 
squares  is  that  it  requires  a  model  structure  which  is  linear  in  the  parameters.  The 
maximum  likelihood  method  is  a  very  general  technique  which  can  be  applied  to  a  wide 
variety  of  model  structures. 

Mgdei_vaiidat ign 

When  a  model  has  been  obtained  from  experimental  data  it  is  necessary  to  check  the 
model  in  order  to  reveal  its  inadequacies .  Black  box  models  should  be  given  particular 
attention  in  this  respect.  For  model  validation  it  is  useful  to  determine  step  responses! 
impulse  responses!  poles  and  zerosi  model-  and  prediction  errors  etc.  Calculation  of 
statistical  quantities  like  correlation  of  prediction  errors  and  cross  correlations 
between  inputs  and  prediction  errors  can  also  be  revealing.  Since  the  purpose  of  the 
model  validation  is  to  scrutinize  the  model  with  respect  to  inadequacies  it  is  useful  to 
look  for  quantities  that  are  sensitive  to  model  changes. 
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where  V  is  the  loss  function  (e.g.  the  negative  logarithm  of  the  likelihood  function)  of 
i 

the  model  having  p  parameters  and  N  the  number  of  sampling  points.  The  model  with  more 
i 

<p  >  parameters  is  preferred  if  the  value  t  is  sufficiently  large. 

2 

An  interesting  approach  to  this  problem  has  recently  been  given  by  Akaike  Cl 93  who 
suggests  using  the  criterion 

AIC  =  -  2'  log  (ML)  t  2p 
(11) 

where  ML  is  the  maximum  likelihood  and  p  is  the  number  of  parameters.  Akaike’ s  criterion! 
which  is  based  on  information  theoretic  considerations!  is  equivalent  to  Eq .  (10)  if  V 

is  close  to  V  .  Other  tests  are  given  in  CS3 . 

z 

When  the  ident i f ica t ion  problem  is  formulated  as  a  statistical  parameter  estimation 
problem!  there  are  many  ideas  and  results  from  statistics  that  can  be  exploited.  For 
example  it  is  passible  to  assign  accuracies  to  the  parameter  estimates  by  using  the 
second  derivative  of  the  likelihood  function.  The  statistical  approach  requires!  however! 
that  certain  assumptions  are  made  on  the  mechanism  which  generated  the  data  i.e.  the  real 
process.  This  is  most  unpleasant  because  the  real  process  is  often  nonlinear! 
t imevaryingi  and  infinite  dimensional  and  little  is  known  about  it. 

Great  care  should  therefore  be  used  when  the  results  of  statistical  analyses  are 
interpreted.  It  has  been  found  empirically  that  many  methods  work  very  well  on  simulated 
data  but  very  poorly  on  real  data.  This  reflects  the  fact  that  certain  results  are 
sensitive  to  variations  in  the  data  generation  and  it  indicates  the  needs  for  research 
into  the  problem  of  mismatch  between  the  model  structure  and  the  data  generation.  A 
particular  problem  of  overfitting  clearly  illustrates  what  can  happen.  If  a  model  which 
has  too  many  parameters  is  fitted  to  a  given  data  set  an  extremely  good  fit  can  of  course 
be  obtained  for  a  particular  data  set.  The  high  order  model  mayi  howeveri  be  very  poor 
when  applied  to  another  data  set.  It  is  therefore  a  good  practical  rule  to  work  with  at 


least  two  data  sets.  One  set  is  used  for  the  identification  and  the  other  for  the 
val idat ion. 


5.  COMPUTER  AIDS 

It  is  a  substantial  effort  to  solve  a  system  identification  problem  for  an 
industrial  process  if  no  prior  experience  and  no  software  is  available.  The  effort  can  be 
reduced  substant ia l 1 y  if  good  computer  software  is  available.  In  particular  it  has  been 
our  experience  that  the  time  and  effort  can  be  reduced  substantial ly  if  suitable  software 
for  interactive  computing  is  available. 

Interactive  computing  requires  an  efficient  man-machine  interface.  A  graphical 
display  which  can  be  used  to  show  curves  is  a  necessity.  Interactive  software  allows  the 
problem  solver  to  combine  his  insight  and  intuition  with  extensive  calculation.  It  also 
gives  a  direct  link  between  the  problem  solver  and  the  computer  without  needing 
programmers  as  intermediaries. 


I  nte  ract  ign_or  i  nc  igj.es 

When  designing  a  system  for  man-machine  interaction  it  is  important  to  realize  that 
there  is  a  wide  range  of  users*  from  novices  to  exgertsi  with  different  abilities  and 
demands.  For  a  novice  who  needs  a  lot  of  guidance  it  is  natural  to  have  a  system  where 
the  computer  has  the  initiative  and  the  user  is  gently  led  towards  a  solution  of  his 
problem.  For  an  expert  user  it  is  much  better  to  have  a  system  where  the  user  keeps  the 
initiative  and  where  he  gets  advice  and  and  help  on  request  only.  Attempts  of  guidance 
and  control  by  the  computer  can  lead  to  frustration  and  inefficiency.  It  is  highly 
desirable  to  design  a  system  so  that  it  will  accomodate  a  wide  range  of  users.  This  makes 
it  more  universal.  It  also  makes  it  possible  to  gradually  shift  the  initiative  from  the 
computer  to  the  user  as  he  becomes  more  proficient. 

To  obtain  an  efficient  man-machine  interface  it  is  desirable  to  have  hardware  with  a 
high  communication  rate  and  a  communication  language  with  a  good  expression  power.  When 
our  projects  were  started  we  were  limited  to  a  teletype  and  a  storage  oscilloscope.  There 
were  also  limited  experiences  of  design  of  man-machine  interfaces.  The  predominant 
approach  was  a  quest ion-and-answer  dialog.  See  C203  . 

In  our  projects  it  was  discovered  at  an  early  stage  that  the  simple 
quest ion-and-answer  dialog  was  too  rigid  and  very  frustrating  for  an  experienced  user. 
The  main  disadvantage  is  that  the  computer  is  in  command  of  the  work  rather  than  the 
user.  This  was  even  more  pronounced  because  of  the  slow  input-output  device  (teletype) 
which  was  used  initially. 

Our  primary  design  goal  was  to  develop  tools  for  the  expert.  A  secondary  goal  was  to 
make  the  tools  useful  also  for  a  novice.  To  make  sure  that  the  initiative  would  remain 
with  the  user  it  was  decided  to  make  the  interaction  command  oriented.  This  was  also 
inspired  by  experiences  from  programming  in  APL.  Use  of  a  command  dialog  also  had  the 
unexpected  effect  that  it  was  passible  to  create  new  user  defined  commands  easily.  It  was 
thus  possible  to  use  the  packages  in  ways  which  were  not  anticipated  when  they  were 
designed.  The  decision  to  use  commands  instead  of  a  question  and  answer  dialog  thus  had 
far  reaching  consequences.  A  more  detailed  discussion  of  the  different  types  of  dialogs 
and  of  our  experiences  of  them  is  given  in  [211.  Today  there  is  a  wide  range  of 
experiences  of  designing  man-machine  interfaces  in  many  different  fields.  Our  own 
conclusions  agree  well  with  those  found  in  [221  and  C231  .  Conclusions  are  based  on 
different  hardware. 


E x amp les_of  commands 

The  structure  of  the  commands  we  introduced  will  now  be  described.  The  general  form 
of  a  command  is 

NAME  LARG1  LARG2 .  .  .  <-  RARG1  RARG2 .  .  . 

A  command  has  a  name.  It  may  also  have  left  arguments  and  right  arguments.  The 
arguments  may  be  numbers  or  names  of  objects  in  a  data  base.  In  our  packages  the  objects 
are  implemented  as  files  because  this  is  a  simple  way  to  deal  with  objects  having 
different  types.  A  few  examples  of  commands  are  given  to  further  illustrate  the  notion  of 
a  command.  The  command 

MAT  OP  S  «-  A  *  B  +  C 

simply  performs  the  matrix  operation  expressed  to  the  right  of  the  arrow. 

The  command 

POLOP  S  n  A  »  B  +  C 
performs  the  same  operation  on  polynomials. 
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The  c  Gulin  and 

INSI  U  100 
) PRBS  4  7 
>  EXIT 

generates  an  input  signal  of  length  100  called  U.  The  command  has  options  to  generate 
several  input  signals.  The  options  are  selected  by  additional  subcommands.  PRBS  is  a 
subcommand  which  selects  a  PRBS  signal.  The  optional  arguments  4  and  7  indicate  that  the 
PRBS  signal  should  change  at  most  every  fourth  sampling  period  and  that  its  period  should 

be  2^-1.  The  subcommand  EXIT  denotes  the  end  of  the  subcommands. 

The  command 

DETER  Y  <-  SYST  U 

generates  the  response  of  the  linear  system  called  SYST  to  the  input  signal  U. 

The  command 

ML  PAR  t-  DAT  N 

fits  an  ARMAX  model  of  order  N  to  the  data  in  the  file  called  DAT  and  stores  the 
parameters  in  a  file  called  PAR. 

The  command 

□  PTFB  L  CLSYS  «-  LOSS  SYS 

computes  the  optimal  feedback  gain  L  and  the  corresponding  closed  loop  system  CLSYS  for 
the  system  SYS  and  the  loss  function  LOSS. 


§bg  rt_£grm_cgmmands_and_de£au  l,t  _  values 

In  a  command  dialog  it  is  highly  desirable  to  have  simple  commands.  This  is  in 
conflict  with  the  requirement  that  commands  should  be  explicit  and  that  it  may  sometimes 
be  desirable  to  have  variants  of  the  commands.  These  opposite  requirements  may  be 
resolved  by  allowing  short  forms  of  the  commands.  The  standard  form  for  tVie  simulation 
command  is  SIMU.  If  no  other  command  starts  with  the  letter  S  it  is*  however*  sufficient 
to  type  S  alone.  It  may  also  be  useful  to  have  a  simple  way  of  renaming  the  commands.  Me 
have  experimented  with  short  farm  commands  and  renaming  mechanisms.  These  functions  are* 
however*  not  implemented  in  our  standard  packages. 

A  similar  mechanism  may  be  used  for  commands  which  use  arguments  by  introducing  a 
default  mechanism  so  that  previous  values  of  the  arguments  are  used  unless  new  values  are 
specified  explicitly.  The  concept  is  illustrated  by  an  example. 

The  syntax  diagram  for  the  command  SIMU  is  shown  in  Fig.  1.  The  diagram  implies  that 
any  form  of  the  .command  which  is  obtained  by  traversing  the  graph  in  the  directions  of 
the  arrows  is  allowed.  For  example  the  command 

SIMU  0  100 

simulates  a  system  from  time  0  to  time  100.  If  we  want  to  repeat  the  simulation  a  second 
time  with  different  parameters  it  suffices  to  write 

SIMU 

The  arguments  0  and  100  are  then  taken  as  the  previously  used  values. 

It  fallows  from  Fig.  1  that  start  and  stop  times  and  the  initial  time  increment  may 
be  specified.  It  is  also  possible  to  mark  curves  by  the  a  'qument  MARK.  A  simulation  may 
also  be  continued  by  using  the  end  conditions  of  a  prev! n.  simulation  as  initial  values. 
This  is  done  by  the  command  extension  C0NT.  The  results  of  a  simulation  may  also  be 
stored  in  a  file. 


Macros 

The  commands  are  normally  read  from  a  terminal  in  a  command  driven  system.  It  is* 
however*  useful  to  have  the  option  of  reading  a  sequence  of  commands  from  a  file  in 
storage  instead.  Since  this  is  analogous  to  a  macro  facility  in  an  ordinary  programming 
language  the  same  nomenclature  is  adopted.  See  C243 .  The  construction 


Figi_i  Syntax  diagram  for  the  command  SIMU. 

MACRO  NAME 
Command  1 
Command  2 
Command  3 

END 

thus  indicates  that  the  commands  li  2  and  3  are  not  executed  but  stored  in  memory.  The 
command  sequence  is  then  activated  simply  by  typing  NAME. 

Macros  are  convenient  for  simplification  of  a  dialog.  Command  sequences  that  are 

commonly  used  may  be  defined  as  macros.  A  simple  macro  call  will  then  activate  a  whole 

sequence  of  commands.  The  macro  facility  is  also  useful  in  order  to  generate  new 

commands.  Macros  may  also  be  used  to  rename  commands.  This  is  useful  in  order  to  tailor  a 

system  to  the  needs  of  a  particular  user. 

The  usefulness  of  macros  may  be  extended  considerably  by  introducing  commands  to 
control  the  program  flow  in  a  macro*  facilities  for  handling  local  and  global  variables 
and  by  allowing  macros  to  have  arguments.  By  having  commands  for  reading  the  keyboard  and 
for  writing  on  the  terminal  it  is  also  possible  to  implement  menu  driven  dialogs  using 
mac  ros . 

An  interactive  CAD  program  based  on  a  command  dialog  with  a  macro-facility  may  be 
viewed  as  an  extendab le_hugh_ .1  evel_g rgb ,1  em_so W i_ng_ language . 

It  is  important  in  interactive  systems  to  have  test  for  avoiding  errors.  It  is  thus 
useful  to  check  data  types  and  to  test  problems  for  consistency  whenever  possible. 

liSBigOSUTation 

It  is  st ra ight forwa rd  to  implement  a  command  driven  interactive  program.  The 
structure  used  in  all  packages  is  shown  in  Fig.  2.  The  main  loop  reads  a  command*  decodes 
it  and  performs  the  required  actions.  All  parts  of  Fig.  2  except  the  action  routines  are 
implemented  as  a  package  of  subroutines  called  Intrac.  These  subroutines  perform  command 
decoding*  file  handling  and  plotting.  Intrac  also  contains  the  macro  facility.  Macros  may 
have  formal  arguments*  local  and  global  variables.  They  permit  conditional  and  repeated 
execution  of  commands  as  well  as  nested  use  of  macros.  There  are  read  and  write  commands* 
which  can  be  used  to  implement  menu  dialogs.  It  is  possible  to  mix  command  mode  and 


rz - 

I  Intrac 


Fia,  2  Skeleton  flaw  chart  for  a  command  driven  program  with  a  macro  facility. 
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question  mode:  since  the  execution  of  a  macro  may  be  suspended  and  resumed  later.  A 
description  of  Intrac  is  given  m  C251  and  1261.  The  commands  available  in  Intrac  are 
listed  in  Appendix  A. 

To  Du i Id  a  package  using  Intrac  it  is  necessary  to  write  the  action  routines  i.e. 
the  Subroutines  that  performs  the  desired  tasks.  The  commands  are  then  entered  in  the 
command  table  of  the  command  decoder.  It  is  also  easy  to  add  a  command  to  a  package:  to 
move  commands  between  packages  and  to  create  special  purpose  packages.  Intrac  may  thus  be 
viewed  as  a  tool  for  converting  a  collection  of  Fortran  subroutines  into  an  interactive 
package.  Intrac  has  also  been  used  to  implement  other  packages  by  other  groups. 

The  structure  with  a  common  user  interface  For  all  packages  is  advantageous  for  the 
user  because  the  interaction  and  the  macro  commands  are  the  same  in  all  packages.  This 
simplifies  learning  and  use  of  the  packages. 

tiSM_£9_£b99f  S-SS-M'dnds? 

The  selection  of  commands  is  one  of  the  major  issues  when  designing  a  CAD  package. 
The  commands  determine  haw  useful  a  package  is  and  how  easy  it  is  to  learn.  It  is 
important  that  commands  are  complete  in  the  sense  that  they  allow  use  of  a  wide  range  of 
techniques  in  an  area.  Otherwise  the  designer  will  only  try  those  approaches  for  which 
commands  are  available.  Commands  should  also  have  a  considerable  exp ressign_ggwe r  so  that 
a  control  system  designer  can  do  what  he  wants  with  a  few  commands.  The  commands  should 
also  reflect  the  natu raf_cgncepts  from  a  theoretical  point  of  view.  This  would  make  it 
easy  for  a  user  well  versed  in  control  theory  to  use  a  package.  The  commands  should  also 

be  few _ and_simpfe  so  that  they  are  easy  to  learn  and  remember.  This  is  of  course  in 

conflict  with  requirements  on  completeness  ancj  expression  power.  Selection  of  commands  is 
thus  a  good  exercise  in  engineering  design. 

Based  on  experiences  from  our  projects  we  have  arrived  at  some  design  principles.  A 
set  of  basic  commands  which  correspond  to  the  elements  of  the  theory  and  which  allow 
coverage  of  a  certain  problem  area  are  first  determined.  S i mpl l f i cat  ions  and  extensions 
are  then  generated  using  the  macro  facility. 


6.  SIMNON 

Simnoh  is  a  package  for  interactive  simulation  of  nonlinear  continuous  time  systems 
with  discrete  time  regulators.  See  C271  and  C2S1  .  The  package  also  includes  noise 
generators:  time-delays:  a  facility  for  using  data  files  from  Idpac  as  inputs  to  the 
system  and  an  optimizer. 

Simnon  allows  a  system  to  be  described  as  an  i nterconnect ion  of  subsystems.  There 
are  two  types  of  subsystems:  continuous  time  systems  and  discrete  time  systems.  This 
makes  Simnon  well  suited  for  simulation  of  digital  control  systems.  The  characteristics 
of  Simnon  are  illustrated  by  an  example. 

Listing  1  gives  a  description  of  a  feedback  loop  consisting  of  a  continuous  time 
process  called  PROC  and  a  digital  PI  regulator  called  RES.  The  process  is  an  integrator 
with  input  saturation.  The  i nte rconnect i ons  are  described  by  the  connecting  system  CON. 

The  following  annotated  dialog  illustrates  how  Simnon  is  used. 

CONTINUOUS  SYSTEM  PROC 

"Integrator  with  input  saturation 

Input  u 

Output  y 

State  x 

Der  dx 

upr=if  u<-0.1  then  -0.1  else  if  u<0.1  then  u  else  0.1 

dx=upr 

END 


DISCRETE  SYSTEM  REG 

"PI  regulator  with  anti-windup 

Input  yr  y 

Output  u 

State  i 

New  ni 

Time  t 

Tsamp  ts 

e=yr— y 

v=k*e+i 

u=if  v<ulow  then  ulow  else  if  vluhigh  then  v  else  uhigh 

ni=i+k*h*e/t i+u-v 

ts=t+h 

kil 

tisl 
h:0.5 
U low  5  —  1 
uhigh: 1 
END 


CONNECTING  SYSTEM  CON 

"Connecting  system  for  simulation  of  process  PROC 

"with  PI  regulation  by  system  REG 

yr[REG3=l 

y  CREGI =y  CPRQC1 

u  CPR0C1 =u  CREG1 

END 


Listing  1  -  Simnon  description  of  a  simple  control  loop  consisting  of  a  continuous  time 
process  and  a  discrete  PI  regulator. 


Command  Action 

SYST  PROC  REG  CON  Activate  the  systems. 

AXES  H  0  100  V  -1  1  Draw  axes. 

PLOT  yr  yCprocl  uCregl  Determine  variables  to  be 

platted . 

STORE  yr  yCpracl  uCreg]  Select  variable  to  be 

stored . 

SI MU  0  100  Simulate. 

SPLIT  2  1  Form  two  screen  windows. 

ASHOW  y  ("Draw  y  with  automatic 

■(scaling  and  yr  with  the 

SHOW  yr  (.same  scales  in  first  window. 

ASHOW  u  Draw  u  with  automatic 

scaling  in  second  window. 

The  result  is  shown  by  the  curves  in  thin  lines  shown  in  Fig.  3.  These  curves  show 
that  there  is  a  considerable  overshoot  due  to  integral  windup.  The  regulator  REG  has 
anti-windup.  The  state  of  the  regulator  is  reset  when  its  output  is  equal  to  ulow  or 
uhigh.  The  limits  were  set  to  ulow=-l  and  uhigh=l  in  the  simulations  shown  with  thin 
lines  in  Fig.  3.  These  values  are  sc  large  that  the  integral  is  never  reset.  The 
simulation!  shows  in  thin  lines  in  Fig.  3t  thus  correspond  to  a  regulator  without 
wind-up.  The  actual  actuator  limitations  correspond  to  ulow=-0.1  and  uhigh=0.1.  The 
commands 

PAR  u 1 ow : -0 . 1 

PAR  uhigh:0.1 

change  the  parameters  and  the  command  SIMU  now  generates  the  curves  shown  in  thick  lines 
in  Fig.  3.  Notice  the  drastic  improvements  due  to  the  nonlinearity  in  the  regulator. 

The  first  version  of  Simnon  was  implemented  in  an  MS  project.  Simnon  has  gone 
through  several  stages  of  development.  A  list  of  the  commands  in  Simnon  is  given  in 
Appendix  6. 


4-12 


Figi_3  Results  of  simulation  of  process  with  PI  regulator.  Thin  lines  show  results  with 
ordinary  regulator  and  thick  lines  show  results  for  regulator  with  anti-windup. 


7.  IDPAC 

Idpac  is  a  package  for  data  analysis  and  i dent i f icat ion  of  linear  systems  having  one 
output  and  many  inputs*  C29]  and  C303  .  Time  series  analysis  of  ARMA  and  ARIMA  models  is  a 
special  case.  The  package  has  commands  for  manipulation  and  plotting  of  data*  correlation 
analysis*  spectral  analysis  and  parametric  system  identification.  There  are  also  commands 
for  model  validation  and  simulation.  The  basic  techniques  used  for  parameter  estimation 
are  the  least  squa res  method  and  the  maximum  likelihood  method.  By  using  the  macro 
facility  it  is*  however*  passible  to  generate  commands  for  most  of  the  parameter 
estimation  methods  which  are  proposed  in  literature.  It  was  actually  in  the  development 
of  Idpac  that  the  power  of  the  macro  concept  became  apparent.  In  the  early  Idpac  versions 
there  were  many  commands  necessary  to  cover  the  available  identifications  methods.  It 
was*  however*  discovered  that  almost  all  methods  could  be  obtained  by  combinations  of 
correlation  analysis*  spectral  analysis*  least  squares  and  maximum  likelihood  estimation. 
Commands  were  thus  constructed  to  give  primitives  for  these  operations  and  the  spectral 
methods  were  then  implemented  as  macros  which  used  the  primitive  commands.  This  approach 
is  also  a  pedagogical  way  to  structure  the  problem  area. 

Idpac  can  be  viewed  as  a  convenient  way  of  packaging  the  research  in  systems 
ident i f icat ion  that  has  been  done  at  our  department  for  a  period  of  20  years.  Idpac  has 
gone  through  several  steps  of  development.  The  relevant  theoretical  basis  is  given  in 
C301  which  also  contains  a  comprehensive  examples  of  using  Idpac.  A  summary  of  the 
commands  are  given  in  Appendix  C.  A  short  description  of  some  features  of  IDPAC  is  given 
below. 

The  program  has  facilities  for  input-output*  editing  and  display  of  data.  It 
includes  several  estimation  procedures  like  correlation  and  spectral  analysis*  least 
squares  and  maximum  likelihood  estimation.  It  has  facilities  for  simulation  and  model 
analysis.  The  program  is  command  driven*  which  means  that  the  user  initiates  the 
different  operations  by  typing  commands  on  a  terminal.  The  program  also  has  a  MACRO 
facility*  which  means  that  a  user  can  combine  several  commands.  In  this  way  it  is 
possible  both  to  have  a  large  flexibility  for  the  experienced  user  and  to  allow  for  a 
simple  use  of  standardized  procedures  for  an  inexperienced  user.  An  example  of  the  use  of 
the  program  is  given  below. 

1.  MOVE  DK  WORK  f-  WORK  C2)  1 

2.  PLOT  WORK 

3.  TREND  (-  WORK  1 

а.  ML  PARI  <-  WORK  1 

5.  ML  PAR2  *-  WORK  2 

б.  ML  PAR3  <-  WORK  3 

The  first  command  simply  moves  the  columns  1  and  3  on  the  data  file  DATA  from  magnetic 
tape  to  a  work  area  on  the  disc.  The  second  command  plots  the  data  on  the  graphical 
display.  The  third  command  removes  a  first  order  trend  from  the  second  column  in  the  file 
WORK.  The  commands  4*  5  and  6  perform  Maximum  Likelihood  estimation  of  the  parameters  in 
the  model  (2.2)  for  orders  1*  2  and  3  using  the  data  in  the  file  WORK.  The  estimated 
parameters  are  stored  in  the  files  PARI*  PAR2  and  PAR3 . 

The  analysis  of  the  models  can  proceed  as  follows. 

7.  RES  ID  RES  «-  PAR2  WRK  20 

This  means  that  the  residuals  of  the  model  with  parameters  PAR2  are  computed  and  stored 
in  the  file  RES.  In  this  computation  the  covariance  function  of  the  residuals  and  the 
cross  covariance  function  between  the  input  and  the  residuals  are  also  computed  and 
automat ica 1 1 y  displayed.  The  commands 


8.  DETER  DET  «-  PAR2  WORK  ( 1) 


computes  the  determi nist ic  output  of  the  model  with  parameters  PAR2  when  the  input  is  the 
process  input  WORK  (1)  and  the  disturbances  neglected.-  The  command 

9.  PLOT  NL  WORK  (2)  DET 

finally  plots  the  process  output  work  (2)  as  separate  points  and  the  output  of  the 
simulated  model. 

Command-driven  programs  like  IDPAC  have  several  advantages.  The  commands  can  be  read 
from  a  file  on  disc  instead  from  the  input  terminal.  By  combining  this  with  the  macro 
facility  it  is  easy  to  obtain  new  commands  simply  by  combining  already  existing  commands. 
In  this  way  it  is  easy  to  generate  commands  for  multistage  least  squaresi  extended  least 
squarest  by  basic  least  squares  command.  IDPAC  may  be  viewed  as  a  special  high  level 
language  for  system  identif ication. 

The  use  of  a  macro  will  be  demonstrated  using  an  example.  Assume  that  a  transfer 
function  model  given  by  Eq .  (3)  has  been  estimated  using  a  parameter  estimation  scheme 
which  also  estimates  the  parameter  uncertainties.  Since  the  transfer  function  parameters 
and  their  uncertainties  do  not  give  much  physical  insight  it  is  useful  to  make  a  Monte 
Carlo  simulation  of  the  responses  of  a  system  whose  parameters  have  a  distribution  with 
the  estimated  means  and  covariances.  This  is  simple  in  principle  byt  tedious  to  program. 
Using  the  MACRO  facility  the  problem  is  solved  as  follows. 

1.  MACRO  MCSIM  Y  t-  MOD  U  NL 

2.  FOR  1  =  1  TO  NL 

3.  RANPA  P  <-  MOD 

4.  DETER  YC1)  <-  P  U 

5.  NEXT  I 

6.  PLOT  Y 

7.  END 

This  macro  generates  the  new  command 

MCSIM  Y  «-  MOD  U  NL 

which  performs  NL  number  of  Monte  Carlo  simulations  of  a  system  MOD  having  uncertain 
parameters.  The  input  signal  is  U  and  the  output  signals  are  stored  as  columns  in  the 
file  Y.  The  first  line  is  simply  the  macro  definition.  Lines  2  and  5  control  the 
iteration.  The  third  line  generates  a  parameter  vector  P  by  sampling  a  gaussian 
distribution  whose  mean  value  is  th^  estimated  parameters  ®  and  whose  covariance  is  the 
estimated  covariances  R.  These  are  stored  in  the  file  MOD.  The  fourth  line  is  a  simple 
simulation  command.  It  generates  the  output  Y  from  a  model  with  parameters  P  having  the 
input  U. 

Having  defined  the  Macro  it  can  now  be  used  as  follows: 

1.  ML  MODEL  <-  DATA  2  SA 

2.  SAVE  COMAT 

3.  LET  NPLX .  =  100 

4.  INSI  U  NPLX 

5.  > PULSE 

6.  MCSIM  Y  4-  MODEL  U  6 

The  first  command  is  an  ML-command  to  generate  a  second  order  ML  model  from  the  measured 
data  stored  in  the  file  DAT.  The  argument  SA  in  the  command  1  means  that  a  special 

command  is  required.  The  second  line  specifies  that  the  covariance  matrix  should  be  saved 

and  stored  in  the  file  called  MODEL.  The  third  command  defines  that  the  variable  NPLX 
should  be  given  the  value  100.  A  signal  called  U  of  length  NPLX  is  defined  in  statement  4 

and  statement  5  specifies  this  signal  to  be  a  unit  pulse.  Command  number  6  finally  calls 

the  macro  command  that  was  just  generated.  The  curves  shown  in  Fig.  4  are  then  displayed 
on  the  graphic  screen. 

The  experiences  with  the  interactive  package  IDPAC  have  been  very  good.  The  program 
has  made  it  possible  to  analyse  results  from  industrial  experiments  quickly  and  at  a 
reasonable  cost.  The  program  has  also  been  a  very  useful  teaching  aid.  It  has  made  it 
possible  to  teach  system  ident i f icat ion  efficiently  in  a  short  time  (about  a  week)  both 
to  students  in  the  university  and  to  engineers  in  industry.  The  program  package  is  now 
being  used  by  a  number  of  industries. 


CONCLUSIONS 


s . 


Computer  aided  design  of  control  systems  is  still  in  its  early  stages.  There  are  a 
number  of  packages  like  those  described  in  this  paper.  An  overview  of  some  packages  are 
round  in  [311*  [32]*  C331 .  [34] ,  [351 *  [36] «  [37] i  [381  and  [391.  More  references  are 
also  found  in  these  papers.  Special  workshops  and  symposia  devoted  to  CAD  for  control 
systems  have  been  organized  by  IFAC*  GE-RPI*  and  IEEE  CSS.  See  [40]*  [411 t  [42]  and  [431. 
Computer  aided  tools  are  also  popular  in  many  other  fields  e.g.  mechanical  design  and 
VLSI  design.  The  seminal  work  on  computer  graphics  [221  contain  much  material  and  many 
rererences. 


Fig.  4  Monte  Carlo  simulation  of  a  model  of  ship  dynamics.  The  curves  show  how  the 
uncertainties  in  the  parameters  of  a  transfer  function  model  are  reflected  in 
uncertainties  in  the  impulse  response. 


The  field  is  in  a  state  of  rapid  development  due  to  an  increased  understanding  of 
the  technology  and  the  drastic  development  of  computer  and  graphics  hardware.  It  is  safe 
to  predict  that  future  computer  aided  design  tools  will  be  much  more  powerful  than  the 
packages  described  in  this  paper.  Some  speculations  on  future  development  are  given  in 
caai . 


Interactive  computing  is  a  powerful  tool  for  problem  solving.  An  engineer  can  come 
to  the  work  station  with  a  problem  and  he  can  leave  with  a  complete  solution  after  a  few 
hours.  The  results  are  well  documented  in  terms  of  listings*  text  and  graphs.  The  problem 
solver  can  obtain  the  solution  by  himself  without  relying  on  programmers  as 
intermediaries.  Our  projects  have  shown  that  the  productivity  in  analysing  and  designing 
control  systems  can  be  increased  substant ia 1 ly  by  using  these  tools,  kle  believe  that 
interactive  computer  aided  design  tools  is  one  possibility  to  make  modern  control  theory 
cost  effective. 

Computer  aided  design  of  control  systems  is  still  in  its  infancy.  A  small  number  of 
systems  have  been  implemented  in  a  few  places.  There  are  many  possible  future 
developments  which  are  mainly  driven  by  the  computer  development.  Packages  of  the  type  we 
have  been  experimenting  with  can  easily  be  fitted  into  the  personal  computers  or  work 
stations  that  will  be  available  in  a  few  years  time.  The  bit  mapped  high  resolution  color 
displays  that  will  be  available  on  these  computers  offer  new  possibilities  for  an 
efficient  man-machine  dialog.  With  the  drastic  increase  in  computer  capacity*  that  is 
forth  coming*  it  is  also  passible  to  make  much  more  ambitious  projects.  Applications  of 
computer  aided  design  also  appear  in  many  other  branches  of  engineering.  Cross 
fertilization  between  the  fields  will  most  likely  lead  to  a  rapid  development. 

The  procedure  for  obtaining  control  law  consists  of  the  steps!  experimental  design* 
experiments*  parameter  estimation*  control  design  and  implementation.  It  may  be  a 
considerable  effort  to  go  through  these  steps.  As  an  alternative  we  may  consider  adaptive 
control  which  may  be  viewed  as  an  automation  of  the  procedure. 
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APPENDIX  A  -  Intrac  commands 


1 .  I nput 
READ 
SWITCH 
WRITE 


and  output 

-  Read  string  or  variable  from  keyboard 

-  Utility  command 

-  Write  string  or  variable  on  terminal 


2.  Assignm 

DEFAULT  - 

FREE 

LET 

STOP 


ent 

Assign  default  values 
Release  assigned  global  variables 
Assignment  of  variables  and  global 
Stop  execution  and  return  to  OS 


pa  rameters 


3.  Control 
FOR.. TO  - 
NEXT  V 
LABEL  L  - 
GOTO  L 
IF. .GOTO  - 


of  program  flow 
Loop 

Declaration  of  label 
Transfer  control 
Transfer  control 


A.  Macro 

END  -  End  of  macro  definition 

FORMAL  -  Declaration  of  formal  arguments 

MACRO  -  Macro  definition 

RESUME  -  Resume  execution  of  macro 

£' 'SPEND  -  Suspend  execution  of  macro 

APPENDIX  B  -  Simnon  commands 

1.  Utilities 

EDIT  -  Edit  system  description 

GET  -  Get  parameters  and  initial  values 

LIST  -  List  files 

PRINT  -  Print  files 

SAVE  -  Save  parameter  values  and  initial  values  in  a  file 
STOP  -  Stop 

2.  Graphic  output 

AREA  -  Select  window  on  screen 

ASHOW  -  Plot  stored  variables  with  automatic  scaling 

AXES  -  Draw  axes 

HCOPY  -  Make  hard  copy 

SHOW  -  Plot  stored  variables 

SPLIT  -  Split  screen  into  windows 

TEXT  -  Transfer  text  string  to  graph 

3.  Simulation  Commands 

ALGOR  -  Select  integration  algorithm 

DISP  -  Display  parameters 

ERROR  -  Choose  error  bound  for  integration  routine 

INIT  -  Change  initial  values  of  state  variables 

PAR  -  Change  parameters 

PLOT  -  Choose  variables  to  be  plotted 

SIMU  -  Simulate  a  system 

STORE  -  Choose  variables  to  be  stored 

SYST  -  Activate  systems 

APPENDIX  C  -  Idpac  Commands 

1.  Utilities 

CONV  -  Conversion  of  data  to  internal  standard  format 

DELET  -  Delete  a  file 

EDIT  -  Edit  system  description 

FHEAD  -  Inspect  and  change  file  parameters 

FORMAT  -  Conversion  of  data  to  symbolic  external  form 

FTEST  -  Check  existence  of  a  file 

LIST  -  List  files 

MOVE  -  Move  data  in  database 

TURN  -  Change  program  switches 

2.  Graphic  output 

BODE  -  Plot  Bode  diagrams 
HCOPY  -  Make  hard  copy 

PLMAG  -  Magnify  plot  and  allow  changes  of  data 
PLOT  -  Plot  curves  with  linear  scales 

3.  Time  series  operations 

ACOF  -  Compute  autocorrelation  function 

CCOF  -  Compute  cross-correlation  function 

CONC  -  Concatenate  time  series 

CUT  -  Extract  a  part  of  a  time  series 

INSI  -  Generate  time  series 

PICK  -  Pick  equidistant  time  points 

SCLOP  -  Do  scalar  operations  on  a  time  series 

SLIDE  -  Introduce  relative  delays  between  time  series 

STAT  -  Compute 

TREND  -  Remove  a  trend 

VECOP  —  Do  vector  operations  on  a  time  series 

4.  Frequency  response  operations 
ASPEC  -  Compute  an  auto  spectrum 
CSPEC  -  Compute  a  cross  spectrum 
DFT  -  Discrete  Fourier  Transform 
FROP  -  Operate  on  frequency  responses 
IDFT  -  Inverse  Discrete  Fourier  Transform 
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5.  Simulation  and  model  analysis 
DETER  -  Deterministic  Simulation 
DSIM  -  Simulation  with  noise 
FILT  -  Compute  a  filter  system 

RANPA  -  Pick  parameters  from  a  random  distribution 

RESID  -  Compute  residuals  with  statistical  tests 

SPTRF  -  Compute  the  frequency  response  of  a  transfer  function 

6.  Identification 

LS  -  Least  Squares  identif ication 
ML  -  Maximum  Likelihood  identification 
SQR  -  Least  Squares  data  reduction 
STRUC  -  Least  Squares  structure  definition 
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NUMERICAL  ASPECTS  OF  CONTROL  DESIGN  COMPUTATIONS 
Professor  Alan  J.  Laub 

Department  of  Electrical  Engineering  -  Systems 
University  of  Southern  California 
Los  Angeles,  CA  90089 


SUMMARY 

The  interplay  between  recent  results  and  methodologies  in  numerical  linear  algebra 
and  mathematical  software  and  their  application  to  problems  arising  in  systems,  control, 
and  estimation  theory  is  discussed.  The  impact  of  finite  precision,  finite  range  arith¬ 
metic  (including  the  implications  of  the  proposed  IEEE  Floating  Point  Standard(s))  on 
control  design  computations  is  illustrated  with  numerous  examples  as  are  pertinent  remarks 
concerning  numerical  stability  and  conditioning.  Basic  tools  from  numerical  linear  algebra 
such  as  linear  equations,  linear  least  squares,  eigenproblems ,  generalized  eigenproblems , 
and  singular  value  decomposition  are  then  outlined.  A  selected  list  of  applications  of 
the  basic  tools  then  follows  including  algorithms  for  solution  of  problems  such  as  matrix 
exponentials,  frequency  response,  system  balancing,  and  matrix  Riccati  equations.  The 
implementation  of  such  algorithms  as  robust  mathematical  software  is  then  discussed.  A 
number  of  issues  are  addressed  including  characteristics  of  reliable  mathematical  software, 
availability  and  evaluation,  language  implications  (Fortran,  Ada,  etc.),  and  the  overall 
role  of  mathematical  software  as  a  component  of  computer-aided  control  system  design. 

1.  INTRODUCTION 

This  paper  provides  an  introduction  to  various  aspects  of  the  numerical  solution  of 
selected  problems  of  interest  in  systems,  control,  and  estimation  theory.  Space  limitations 
preclude  an  exhaustive  survey;  rather,  a  compact  "introduction  to  the  literature"  will 
lead  the  interested  reader  to  sources  of  additional,  detailed  information. 

Many  of  the  problems  considered  here  arise  in  the  study  of  the  "standard"  linear 

model 


ic  (t)  =  AX  (t) 

+  Bu (t) 

(1) 

y (t)  =  Cx ( t) 

+  Du (t)  . 

(2) 

Here,  x(t)  is  an  n-vector  of  states,  u(t)  is  an  m-vector  of  controls  or  inputs,  and  y(t) 
is  an  r-vector  of  outputs.  The  standard  discrete-time  analogue  of  (1),  (2)  takes  the  form 

xk+l  =  Axk  +  Buk  (3) 

*k  =  Cxk  +  Dv  (4) 

Of  course,  considerably  more  elaborate  models  are  also  studied,  including  time-varying, 
stochastic,  and  nonlinear  versions  of  the  above.  In  fact, the  above  linear  models  are 
usually  derived  as  linearizations  of  nonlinear  models  about  selected  nominal  points.  The 
interested  reader  is  referred  to  standard  textbooks  such  as  [ 1 ] — [ 4 ]  for  further  details. 


The  matrices  considered  here  will,  for  the  most  part,  be  assumed  to  have  real  co¬ 
efficients  and  be  small  (of  order  a  few  hundred  or  less)  and  dense  with  no  particular  ex¬ 
ploitable  structure.  Calculations  for  most  problems  in  classical  single-input,  single¬ 
output  control  fall  into  this  category.  It  must  be  emphasized  that  consideration  of  large, 
sparse  matrices  or  matrices  with  special,  exploitable  structure  may  involve  significantly 
different  concerns  and  methodologies  than  those  to  be  discussed  here. 

The  systems,  control,  and  estimation  literature  is  replete  with  ad  hoc  algorithms  to 
solve  the  computational  problems  which  arise  in  the  various  methodologies.  Many  of  these 
algorithms  work  quite  well  on  some  problems  (e.g.,  "small  order"  matrices)  but  encounter 
numerical  difficulties,  often  severe,  when  "pushed"  (e.g.,  on  larger  order  matrices). 

The  reason  for  this  is  that  little  or  no  attention  has  been  paid  to  how  the  algorithms 
will  perform  in  "finite  arithmetic",  i.e.,  on  a  finite-word-length  digital  computer. 


A  simple  example  due  to  Moler  and  Van  Loan  (5J  will  illustrate  a  typical  pitfall. 
Suppose  it  is  desired  to  compute  the  matrix  eA  in  single  precision  arithmetic  on  an  IBM  370 
computer.  In  this  particular  computing  environment  we  have,  roughly  speaking,  about  6 
decimal  places  of  precision  in  the  fraction  part  of  floating  point  numbers.  Consider  the 


A  =  [Ig4  j4 j  and  suppose  the  computation  is  attempted  using  the  formula 

V~  '  o A  .  V  I 


=  1  —  A 

L  k  1 
k=0  K1 


This  is  easily  coded  and  it  is  determined  that  the  first  60  terms  in  the  series  suffice 
for  the  computation,  in  the  sense  that  terms  for  k  a  60  are  0(10-7)  and  no  longer  add  any 
thing  significant  to  the  sum.  The  resulting  answer  is 


-22.2588 

-61.4993 


-1.43277 

-3.47428 
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Unfortunately,  the  true  answer  is  (correctly  rounded) 

/- 0.735759  0.551819^ 

1  -1.47152  1.  10364  ) 

and  one  sees  a  rather  alarming  disparity.  What  happened  here  was  that  the  intermediate 
terms  in  the  series  got  very  large  before  the  factorial  began  to  dominate.  In  fact,  the 
17th  and  18th  terms,  for  example,  are  of  the  order  of  107  but  of  opposite  signs  so  that 
the  less  significant  parts  of  these  numbers  -  while  significant  for  the  final  answer  - 
are  "lost"  because  of  the  finiteness  of  the  arithmetic. 

Now  for  this  particular  example  various  fixes  and  remedies  are  available.  But  in  more 
realistic  examples  one  seldom  has  the  luxury  of  having  the  "true  answer"  available  so  that 
is  is  not  always  easy  to  simply  inspect  or  test  an  answer  such  as  the  one  obtained  above 
and  determine  it  to  be  in  error.  Mathematical  analysis  (truncation  of  the  series,  in  the 
example  above)  alone  is  simply  not  sufficient  when  a  problem  is  analyzed  or  solved  in 
finite  arithmetic  (truncation  of  the  arithmetic) .  Clearly,  a  great  deal  of  care  must  be 
taken. 

The  finiteness  inherent  in  representing  real  or  complex  numbers  as  floating-point 
numbers  on  a  digital  computer  manifests  itself  in  two  important  ways:  floating  point  numbers 
have  only  finite  precision  and  finite  range.  In  fact,  it  is  the  degree  of  attention  paid 
to  these  two  considerations  that  distinguishes  many  reliable  algorithms  from  more  unreli¬ 
able  counterparts.  Reference  [6]  still  provides  the  def initive  introduction  to  the  vagaries 
of  floating-point  computation  while  (7]  and  the  references  therein  may  be  consulted  to 
bring  the  interested  reader  up  to  date  on  roundoff  analysis. 

An  even  more  recent  development  in  floating-point  arithmetic  has  been  the  work  of  the 
Floating-Point  Working  Group  of  the  Microprocessor  Standards  Subcommittee  of  the  IEEE 
Computer  Society  Standards  Committee.  An  early  draft  (Draft  8.0)  of  the  IEEE  Standard  754 
for  Binary  Floating-Point  Arithmetic  appears  in  [8]  along  with  several  related  articles. 
Draft  10.0  of  P754  was  sent  in  December  1982  to  the  appropriate  sponsoring  committee  for 
balloting  at  the  next  level  (a  successful  vote  would  send  the  draft  to  the  IEEE  standards 
board) .  A  parallel  effort  (Task  854)  is  also  under  way  to  draft  a  Radix-Free  Standard. 

These  standards  define  families  of  commercially  feasible  ways  for  new  systems  (originally 
microprocessor  but  also  now  minicomputers  and  large  mainframes)  to  perform  binary  floating¬ 
point  arithmetic  in  a  numerically  "sensible"  way.  The  adoption  of  these  Standards  will 
have  a  major  impact  on  algorithmic  development  and  software.  In  fact,  early  versions  of 
the  Binary  Standard  are  already  available  for  certain  microprocessor  systems  (e.g.,  the 
Intel  8087)  which  will  come  into  ever-increasing  use  in  control  and  estimation  in  the  1980 's 
and  1990 's. 

The  development,  in  systems,  control,  and  estimation  theory,  of  stable,  efficient, 
and  reliable  algorithms  which  respect  the  constraints  of  finite  arithmetic  is  only  now  in 
its  infancy.  Much  of  current  research  in  numerical  analysis  is  directly  applicable  but 
there  are  many  computational  issues  in  control  (e.g.,  the  presence  of  hard  or  structural 
zeros)  where  numerical  analysis  does  not  yet  provide  a  ready  answer  or  guide.  A  symbiotic 
relationship  has  already  developed  which  is  sure  to  provide  a  continuing  source  of  challen¬ 
ging  research  areas. 

The  abundance  of  numerically  fragile  algorithms  is  partly  explained  by  the  following 
observation  which  will  be  emphasized  by  calling  it  a  "folk  theorem": 

If  an  algorithm  is  amenable  to  "easy"  hand  calculation,  it's  probably 
a  poor  method  if  implemented  in  the  finite  floating-point  arithmetic 
of  a  digital  computer. 

For  example,  when  confronted  with  finding  the  eigenvalues  of  a  2x2  matrix  most  people 
would  find  the  characteristic  polynomial  and  solve  the  resulting  quadratic  equation.  But 
when  extrapolated  as  a  general  method  and  implemented  on  a  digital  computer  this  turns  out 
to  be  a  very  poor  procedure  indeed  for  a  variety  of  reasons  (such  as  roundoff  and  overflow/ 
underflow) .  Of  course  the  preferred  method  now  would  generally  be  the  double  Francis  QR 
algorithm  (see  19],  [10]  for  the  messy  details)  but  few  of  us  would  attempt  that  by  hand  — 
even  for  very  small  order  problems. 

In  fact,  it  turns  out  that  many  algorithms  which  are  now  considered  fairly  reliable 
in  the  context  of  finite  arithmetic  are  not  amenable  to  hand  calculation  (e.g.,  various 
classes  of  orthogonal  similarities) .  This  is  sort  of  a  converse  to  the  folk  theorem. 
Particularly  in  linear  control  and  systems  theories,  we  have  been  too  easily  seduced  by 
the  ready  availability  of  closed-form  solutions  and  numerically  naive  methods  to  implement 
those  solutions.  For  example,  in  solving  the  initial  value  problem 

fc(t)  =  Ax ( t )  ;  x(0)  =  xQ  (6) 

it  is  not  at  all  clear  that  one  should  want  to  compute  the  intermediate  quantity  etA. 

Rather,  it  is  the  vector  e^xg  that  is  desired,  a  quantity  that  may  be  computed  more  rea¬ 
sonably  by  treating  (6)  as  a  system  of  (stiff)  differential  equations  and  using,  say,  an 
implicit  method  for  numerical  integration  of  the  differential  equation.  But  such  techniques 
are  definitely  not  attractive  for  hand  computation. 

Remedying  the  present  situation  is  largely  a  matter  of  awareness  and  education.  While 
it  is  a  slow  process,  we  are  now  just  beginning  to  see  some  of  the  background  material 
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(well-known  to  numerical  analysts)  mentioned  in  this  paper  filter  down  to  the  undergraduate 
and  graduate  curriculum  in  mathematics  and  engineering.  Introductory  textbooks  such  as  fill- 
113]  are  now  also  ref lecting  a  strong  software  component.  This  process  is  certain  to  have  a 
significant  impact  on  the  future  directions  and  development  of  control  and  systems  theory 
and  applications  as  witness  the  growth  of  CACSD  as  an  intrinsic  tool.  Algorithms  imple¬ 
mented  as  mathematical  software  are  a  critical  "inner"  component  of  a  CACSD  system  and  the 
remainder  of  this  paper  will  address  some  of  the  issues  involved. 

Before  proceeding  further  we  shall  list  here  some  notation  to  be  used  in  the  sequel. 

Fnxm  the  set  of  all  nxm  matrices  with  coefficients  in  the  field  T  (Twill 
generally  be  Sor  <E) 

jpnxm  the  set  of  all  nxm  matrices  of  rank  r  with  coefficients  in  the  field  F 

At  the  transpose  of  A  e  IRnX11' 

A  the  complex-conjugate  transpose  of  A  e  <Enxm 

A+  the  Moore-Penrose  pseudoinverse  of  A 

I|A[j  the  spectral  norm  of  A  (i.e.,  the  matrix  norm  subordinate  to  the 
Euclidean  vector  norm:  [I a!I  =  max  !|ax!|_) 

!!x||2=i  12 

diag (a^, . . . , an>  the  diagonal  matrix  /a^  0\ 


A (A)  the  set  of  eigenvalues  Aj.,...,An  (not  necessarily  distinct)  of  A  e  Fnxn 
(A)  the  ith  eigenvalue  of  A 

E  (A)  the  set  of  singular  values  a^,...,om  (not  necessarily  distinct) of  A  e  Fnxm 
ai(A)  the  ith  singular  value  of  A. 

Finally,  let  us  define  a  particular  number  to  which  we  shall  make  frequent  reference 
in  the  sequel.  The  machine  epsilon  or  machine  precision  can  be  defined,  roughly  speaking, 
as  the  smallest  positive  number  €  which,  when  added  to  1  on  our  computing  machine,  gives 
a  number  greater  than  1.  In  other  words,  any  machine  representable  number  5  less  than  e 
gets  "rounded  off"  when  (floating-point)  added  to  1  to  give  exactly  1  again  as  the  rounded 
sum.  The  number  e  varies,  of  course,  depending  on  the  kind  of  computer  being  used  and  the 
precision  with  which  the  computations  are  being  done  (single  precision,  double  precision, 
etc.).  But  the  fact  that  there  exists  such  a  positive  number  e  is  entirely  a  consequence 
of  finite  word  length. 

2.  NUMERICAL  STABILITY  AND  CONDITIONING 

In  this  section  we  give  a  very  brief  discussion  of  two  concepts  of  fundamental  impor¬ 
tance  in  numerical  analysis:  numerical  stability  and  conditioning.  While  this  material 
is  standard  in  introductory  textbooks  such  as  (14]- [17]  it  is  presented  here  both  for  com¬ 
pleteness  and  because  the  two  concepts  are  frequently  confused  in  the  systems/control/ 
estimation  literature. 

Suppose  we  have  some  mathematically  defined  problem  represented  by  f  which  acts  on 
data  del)  =  some  set  of  data,  to  produce  a  solution  f  (d)  e^  =  some  set  of  solutions.  These 
notions  are  kept  deliberately  vague  for  expository  purposes.  Given  d e Z  we  desire  to  com¬ 
pute  f  (d) .  Suppose  d*  is  some  approximation  to  d.  If  f(d*)  is  "near"  f (d)  the  problem 
is  said  to  be  well-conditioned.  If  f ( d* )  may  potentially  differ  greatly  from  f (d)  even 
when  d*  is  near  d,  the  problem  is  said  to  be  ill-conditioned.  Again,  the  concept  "near” 
cannot  be  made  precise  without  further  information  about  a  particular  problem. 

A  simple  example  of  an  ill-conditioned  problem  is  the  following.  Consider  the  n  x  n 
matrix  , 

/  0  1  0  ...  0  \ 


with  n  eigenvalues  at  0.  Now  consider  a  small  perturbation  of  the  data  (the  elements 
of  A)  consisting  of  adding  the  number  2-n  to  the  first  element  in  the  last  (ntfll  ?ow  of  A. 
This  perturbed  matrix  then  has  n  distinct  eigenvalues  Aj_,...,\„  with  =  ^exp  ( SJ^jLl)  .  Thus 
we  see  that  this  small  perturbation  in  the  data  has  been  magnified  by  a  factor  on  the  order 
of  2n  to  result  in  a  rather  large  perturbation  in  the  solution  (the  eigenvalues  of  A) . 
Further  details  and  related  examples  are  to  be  found  in  (9]. 

Note  that  we  have  so  far  made  no  mention  of  how  the  problem  f  above  (computing  (A) 
in  the  example)  was  to  be  solved.  Conditioning  was  a  function  solely  of  the  problem 
itself.  To  solve  a  problem  numerically  we  typically  must  implement  some  numerical  pro- 
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cedure  or  algorithm  which  we  shall  denote  by  f * .  Thus,  given  d,  f* (d)  represents  the  re¬ 
sult  of  applying  the  algorithm  to  d  (for  simplicity,  we  assume  d  is  "representable";  a  more 
general  definition  can  be  given  when  some  approximation  d**  to  d  is  used).  The  algorithm 
f*  is  said  to  be  numerically  stable  if,  for  all  d  e  i,  there  exists  d*  ;  near  d  such  that  f*  (d) 
is  near  f  (d* )  (=  the  exact  solution  of  a  nearby  problem)  .  I  f  the  problem  is  well-conditioned  then 
f(d*)  will  be  near  f  ( d)  so  that  f*  (d)  will  be  near  f  (d)  .  In  other  words,  f  *  does  not  int  rod  ucc 
any  more  sensitivity  to  perturbation  than  is  inherent  in  the  problem.  Example  1.  below 
will  further  illuminate  this  definition  of  stability  which,  on  a  first  reading,  can  seem 
somewhat  confusing. 

Of  course,  one  can't  expect  a  stable  algorithm  to  solve  an  ill-conditioned  problem 
any  more  accurately  than  the  data  warrant  but  an  unstable  algorithm  can  produce  poor  solu¬ 
tions  even  to  well-conditioned  problems.  Example  2.  below  will  illustrate  this  phenomenon. 
There  are  thus  two  separate  factors  to  consider  in  determining  the  accuracy  of  a  computed 
solution  f* (d) .  First,  if  the  algorithm  is  stable,  f* (d)  is  near  f(d*)  and  second,  if  the 
problem  is  well-conditioned  then,  as  above,  f(d»)  is  near  f  (d) .  Thus  f* (d)  is  near  f(d) 
and  we  have  an  "accurate"  solution. 

Roundoff  errors  can  cause  unstable  algorithms  to  give  disastrous  results.  However, 
it  would  be  virtually  impossible  to  account  for  every  roundoff  error  made  at  every  arith- 
metric  operation  in  a  complex  series  of  calculations  such  as  those  involved  in  most  linear 
algebra  calculations.  This  would  constitute  a  forward  error  analysis.  As  a  more  practical 
alternative,  J.H.  Wilkinson  and  others  have  advanced  the  notion  of  baakuard  error  analysis 
to  account  for  roundoff  error.  Specifically,  for  many  problems  (particularly  in  numerical 
linear  algebra) ,  it  is  possible  to  show  that  what  is  actually  computed  is  near  the  exact  solu¬ 
tion  of  a  nearby  problem.  One  then  attempts  to  show  that  the  nearby  problem  is  near  enough 
which,  if  the  problem  is  well-conditioned,  can  be  translated  into  a  quantitative  statement 
regarding  the  accuracy  of  the  solution.  Examples  of  this  will  be  quoted  in  later  sections. 

We  close  this  section  with  two  simple  examples  to  illustrate  some  of  the  concepts  in¬ 
troduced  above. 


Example  1:  Let  x  and  y  be  two  floating-point  numbers  and  let  fl(x*y)  denote  the  result  of 
multiplying  them  in  floating-point  arithmetic.  In  general,  the  product  x*y  will  require 
more  precision  to  be  represented  exactly  then  was  necessary  to  represent  x  or  y.  But  what 
can  be  shown  for  most  realistic  models  of  floating-point  computation  is  that 

f £ (x*y)  =  x*y  (1+6)  (7) 

where  | 5 |  <  €.  In  other  words,  f£(x*y)  is  x*y  correct  to  within  a  unit  in  the  last  place. 
Now,  another  way  to  write  (7)  is  as 

fMx*y)  =  xtl+6)*5*  yU+6)*5  (8) 

where  | 6 |  <  €.  This  can  be  interpreted  as  follows:  the  computed  result  f£(x*y)  is  the 
exact  product  of  the  two  slightly  perturbed  numbers  x(l+6)*5  and  y(l+6)*5.  Note  that  the 
slightly  perturbed  data  (not  unique)  may  not  even  be  representable  floating-point  numbers. 
The  representation  (8)  is  simply  a  way  of  accounting  for  the  roundoff  incurred  in  the  al¬ 
gorithm  by  an  initial  (small)  perturbation  in  the  data. 

Example  2:  Gaussian  elimination  with  no  pivoting  for  solving  the  linear  system 

Ax  =  b  (9) 


is  known  to  be  numerically  unstable.  The  following  data  will  illustrate  this  phenomenon. 


Let  A 


_  /0.0001  0.2345)  ,  _  /o .  2  346 )  ...  .  . 

10.6789  O.lOOOy  '  b  \ 0 . 7789/  ‘  A11  comPutat 


ions  will  be  carried  out  in 


4-decimal  place  arithmetic.  The  "true  answer"  x  =  A-^b  is  easily  seen  to  be  [.]  .  Using 
row  1  as  the  "pivot  row"  (i.e.,  subtracting  6789  x  row  1  from  row  2)  we  arrive  at  the  equiv¬ 
alent  triangular  system 

(0.0001  0.2345' 

0  -1592. 

From  the  equation 

-1592x2  =  -1591 

we  find  x2  =  0.9994  and  from  the  equation 

O.OOOlxx  =  0.2346  -  fUO.  2345  *  0.9994) 

=  0.0002 


we  find  x^  =  2.  This  extremely  bad  approximation  to  x^  is  the  result  of  numerical  insta¬ 
bility.  The  problem  itself  can  be  shown  to  be  reasonably  well-conditioned. 


3.  FUNDAMENTAL  PROBLEMS  IN  NUMERICAL  LINEAR  ALGEBRA 


In  this  section  we  give  a  brief  overview  of  some  of  the  fundamental  problems  in  nu¬ 
merical  linear  algebra  which  serve  as  building  blocks  or  "tools"  for  the  solution  of  pro- 


blems  in  systems,  control,  and  estimation. 

3.1  Linear  Algebraic  Equations  and  Linear  Least  Squares  Problems 
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Probably  the  most  fundamental  problem  in  numerical  computing  is  the  calculation  of  a 
vector  x  which  satisfies  the  linear  system 

Ax  =  b  (10) 

where  Ac  ®|?xn  (or  <Tnxn)  .  A  great  deal  is  now  known  about  solving  (10)  in  finite  arith¬ 
metic  both  tor  the  general  case  and  for  a  large  number  of  special  situations.  Some  of  the 
standard  references  include  [14],  [18]- [20]. 

The  most  commonly  used  algorithm  for  solving  (10)  with  general  A  and  small  n  (say  n  s 
200)  is  Gaussian  elimination  with  some  sort  of  pivoting  strategy,  usually  "partial  pivo¬ 
ting."  This  essentially  amounts  to  factoring  some  permutation  of  the  rows  of  A  into  the 
product  of  a  unit  lower  triangular  matrix  L  and  an  upper  triangular  matrix  U.  The  algor¬ 
ithm  is  effectively  stable,  i.e.,  it  can  be  proved  that  the  computed  solution  is  near  the 
exact  solution  of  the  system 

(A  +  E)  x  =  b  (11) 

with  ]  e  j  |  s  <(>  (n)  •  y  •  8  •  e  where  $  (n)  is  a  modest  function  of  n  depending  on  details  of  the 
arithmetic  used,  y  is  a  "growth  factor"  (which  is  a  function  of  the  pivoting  strategy  and 
is  usually  -  but  not  always  -  small),  8  behaves  essentially  like  ||a||  ,  and  e  is  the  machine 
precision.  See  [14]  for  further  details.  In  other  words,  except  for  moderately  patho¬ 
logical  situations,  E  is  "small"  -  on  the  order  of  the  machine  precision. 

The  following  question  then  arises.  If,  because  of  roundoff  errors,  we  are  effectively 
solving  (11)  rather  than  (10),  what  is  the  relationship  between  (A+E)-^b  and  A-1b?  To 
answer  this  question  we  need  some  elementary  perturbation  theory  and  this  is  where  the 
notion  of  condition  number  arises.  A  condition  number  for  the  problem  (10)  is  given  by 

< (a)  :=  ('|a!|  Ha'1!!.  (12) 

Simple  perturbation  results  can  be  used  to  show  that  perturbations  in  A  and/or  b  can  be 
magnified  by  as  much  as  k (A)  in  the  computed  solution.  Estimation  of  k (A)  (since,  of 
course.  A-1  is  unknown)  is  thus  a  crucial  aspect  of  assessing  solutions  of  (10)  and  the 
particular  estimation  procedure  used  is  usually  the  principal  difference  between  competing 
linear  equation  software  packages.  One  of  the  more  sophisticated  and  reliable  condition 
estimators  presently  available  is  based  on  [21]  and  is  implemented  in  LINPACK  [19].  In 
addition  to  the  Ig  condition  estimator  of  [21],  LINPACK  features  many  codes  for  solving 
(10)  in  case  A  has  certain  special  structures. 

Another  important  class  of  linear  algebra  problems  and  one  for  which  codes  are  avail¬ 
able  in  LINPACK  is  the  linear  least  squares  problem: 

min||Ax-b!|2  (13) 

where  Ac  ]R^xn  with  (in  the  simplest  case)  k = n  s m.  The  solution  of  (13)  can  be  written 
formally  as  x  =  A+b.  Here,  standard  references  include  [14],  [19],  [22].  The  method  of 
choice  is  generally  based  upon  the  QR  factorization  of  A:  (for  simplicity,  Ac  ]R^xn) 

A  =  QR  (14) 

where  Rc  3R^xn  is  upper  triangular  and  Qs  ]Rmxn  has  orthonormal  columns,  i.e.,  QTQ  =  I. 

With  special  care  and  analysis  the  case  k  <  n  can  also  be  handled  similarly.  The  factori¬ 
zation  is  effected  through  a  sequence  of  Householder  transformations  H;  applied  to  A. 

T 

Each  H^  is  symmetric  and  orthogonal  and  of  the  form  I  -  -  y~—  where  u  c  IRm  is  specially 

u  u 

chosen  to  introduce  zeros  at  appropriate  places  when  H^  premultiplies  A.  After  n  such 
transformations  we  have 


H  H  .  .  .  .H..A  =  (n) 
n  n-1  1  V  0 / 


from  which  the  factorization  (14)  follows.  Defining  c  and  d  by 


H  H  ...  . H.b  =:  n) 
n  n-1  1  \d/ 


where  ce  ]Rn,  it  is  easily  show.:  that  the  least  squares  solution  x  of  (13)  is  given  by  the 
solution  of  the  linear  system 


The  above  algorithm  can  be  shown  to  be  numerically  stable  and,  again,  a  well-developed 
perturbation  theory  exists  from  which  condition  numbers  can  be  obtained ,  this  time  in  terms 
of 

*(a)  :=  !|a;i  :|a+:i. 

Least  squares  perturbation  theory  is  fairly  straightforward  in  case  A  e  ®n  but  is 


considerably  more  complicated  when  A  is  rank-deficient.  The  reason  for  this  is  that  while 
the  inverse  is  a  continuous  function  of  the  data,  the  pseudoinverse  is  discontinuous.  For 

example,  consider  A  =  =  A+  and  perturbations  E1  =  ^  with 

6  small.  Then  (A+E.)  +  =  f  l+<$^  1+6*"  j  which  is  close  to  A+  but  (A  +  E.)+  = 

V  „  J 

which  gets  arbitrarily  far  from  A+  as  6  is  decreased  towards  0.  For  a  complete  survey  of 
perturbation  theory  for  the  least  squares  problem  and  related  questions  see  [23], 

In  lieu  of  Householder  transformations,  Givens  transformations  may  also  be  used  to 
solve  the  linear  least  squares  problem.  Details  can  be  found  in  [9],  [19],  [22],  [24], 

[25].  Recently,  Givens  transformations  have  received  considerable  attention  for  the 
solution  of  both  linear  least  squares  problems  as  well  as  systems  of  linear  equations  in 
a  parallel  computing  environment.  The  capability  of  introducing  zero  elements  selectively 
and  the  need  for  only  local  interprocessor  communication  make  the  technique  ideal  for 
"parallelization."  Indeed,  there  have  been  literally  dozens  of  "parallel  Givens"  algorithms 
proposed  and  we  include  [26]- [30]  as  representative  references. 


In  the  algebraic  eigenvalue/eigenvector  problem  for  A  e  ]Rnxn  one  seeks  nonzero  solutions 
xs£n  and  X  c  C  which  satisfy 

Ax  =  >.  x.  (16) 

The  classic  reference  on  the  numerical  aspects  of  this  problem  is  Wilkinson  [9]  with  Parlett 
[25]  providing  an  equally  thorough  and  up-to-date  treatment  of  the  case  of  symmetric  A  (in 
which  xe  ]Rn  ,  J  e  B)  ,  A  more  brief  textbook  introduction  is  given  in  [14]. 

It  is  really  only  rather  recently  that  some  of  the  computational  issues  associated  with 
solving  (16)  -  in  the  presence  of  rounding  error  -  have  been  resolved  or  even  understood. 
Even  now  some  problems  such  as  the  invariant  subspace  problem  continue  to  be  active  research 
areas.  For  an  introduction  to  some  of  the  difficulties  which  may  be  encountered  in  trying 
to  make  numerical  sense  out  of  mathematical  constructions  such  as  the  Jordan  canonical  form 
the  reader  is  urged  to  consult  [31]. 

The  most  common  algorithm  now  used  to  solve  (16)  for  general  A  is  the  QR  algorithm  of 
Francis  [32].  A  shifting  procedure  is  used  to  enhance  convergence  and  the  usual  implemen¬ 
tation  is  called  the  double-Francis-QR  algorithm.  Before  the  QR  process  is  applied,  A  is 
initially  reduced  to  upper  Hessenberg  form  AH  (aj.j  =  0  if  i-j  £  2)  [33].  This  is  accom¬ 
plished  by  a  finite  sequence  of  similarities  which  can  be  chosen  to  be  of  the  Householder 
form  discussed  above.  The  QR  process  then  yields  a  sequence  of  matrices  which  are  ortho¬ 
gonally  (again,  of  Householder  type)  similar  to  A  and  which  converge  (in  some  sense)  to  a 
so-called  quasi-upper  triangular  matrix  S.  The  matrix  S  is  block  upper  triangular  with 
lxl  blocks  corresponding  to  real  eigenvalues  of  A  and  2x2  blocks  corresponding  to  complex- 
conjugate  pairs  of  eigenvalues.  The  orthogonal  transformations  from  both  the  Hessenberg 
reduction  and  the  QR  process  may  be  accumulated  into  a  single  orthogonal  transformation  U 
so  that 

UTAU  =  S  (17) 

compactly  represents  the  entire  algorithm. 

An  analogous  process  can  be  applied  in  the  case  of  symmetric  A  and  considerable  simpli¬ 
fications  and  specializations  result.  Moreover,  references  [9]  and  [25]  may  be  cor.' 'll*  id 
regarding  an  immense  literature  concerning  stability  of  the  QR  and  related  algoritn;  ?  and 
conditioning  of  eigenvalues  and  eigenvectors.  Both  subjects  are  vastly  more  complex  for 
the  eigenvalue/eigenvector  problem  than  for  the  linear  equation  problem. 

Quality  mathematical  software  for  eigenvalues  and  eigenvectors  has  been  only  recently 
available  -  in  the  1970 's  -  and  the  EISPACK  [10],  [34]  collection  of  subroutines  represents 

a  pivotal  point  in  the  history  of  mathematical  software.  This  collection  is  primarily 
based  on  the  ’’qorithms  collected  in  [35]. 

Closely  related  to  the  QR  algorithm  is  the  QZ  algorithm  [36]  for  the  generalized  eigen¬ 
value  problem 

Ax  =  XMx  (18) 

where  A,  M c  Hnxn.  Again,  a  Hessenberg-like  reduction,  then  an  iterative  process  are 
implemented  with  orthogonal  (Householder )  transformations  to  reduce  (18)  to  the  form 

QAZy  =  XQMZy  (19) 

where  QAZ  is  quasi-upper-triangular  and  QMZ  is  upper  triangular.  For  a  review  and  refer¬ 
ences  to  results  on  stability,  conditioning,  and  software  related  to  (18)  and  the  QZ  al¬ 
gorithm  see  [37].  The  generalized  eigenvalue  problem  is  both  theoretically  and  numerically 
more  difficult  to  handle  than  the  ordinary  eigenvalue  problem  but  it  finds  numerous  appli- 


cations  in  control  and  systems  theory. 

3.3  The  Singular  Value  Decomposition  and  Some  Applications 


One  of  the  basic  and  important  tools  of  modern  numerical  analysis,  particularly 
numerical  linear  algebra,  is  the  singular  value  decompositon.  We  shall  define  it  here  and 
make  a  few  comments  about  its  properties  and  computation  as  well  as  its  significance  in 

various  numerical  problems. 

Singular  values  and  the  singular  value  decomposition  have  a  long  history  particularly 
in  statistics  and  more  recently  in  numerical  linear  algebra.  Even  more  recently  the  ideas 
are  finding  applications  in  the  control  and  signal  processing  literature,  although  their 
use  there  has  been  overstated  somewhat  in  certain  applications.  For  a  survey  of  the  sing¬ 
ular  value  decomposition,  its  history,  numerical  details,  and  some  applications  in  control 
and  systems  theory,  see  [38]. 

The  fundamental  result  can  be  stated  as  follows  for  the  real  case.  For  complex  ma¬ 
trices  the  result  is  virtually  identical  with  complex-conjugate  transposes  replacing  trans¬ 
poses  and  unitary  matrices  replacing  orthogonal  matrices. 

Theorem  1:  Let  Ae  ‘n.  Then  there  exist  orthogonal  matrices  L'  <:  lRr,Xm  and  V  t;  Rn'<n  such 


/  \  a  =  orv 

Is 

where  £  =1  ]  and  S  =  diag (ay , . . . , ar )  with  a  ...  a  r 

\0  0 


The  proof  of  Theorem  1  is  straightforward  and  can  be  found  in,  for  example,  [14]  and 
[39].  Geometrically,  the  theorem  says  that  bases  can  be  found  (separately)  in  the  domain 
and  co-domain  spaces  of  a  linear  map  with  respect  to  which  the  matrix  representation  of 
the  linear  map  is  diagonal. 

The  numbers  ap,...,ar  together  with  ar+i = 0 , . . . ,  cn  =  0  ate  called  the  singular  values 
of  A  and  they  are  the  positive  square  roots  of  the  eigenvalues  of  ATA.  The  columns  u^, 
k  =  l....,n)  of  U  are  called  the  left  singular  vectors  of  A  (the  orthonormal  eigenvectors 
of  AAT)  while  the  columns  (v^,  k  =  l,...,n)  of  V  are  called  the  right  singular  vectors  of 
A  (the  orthonormal  eigenvectors  of  ATA).  The  matrix  A  can  then  also  be  written  (as  a 
dyadic  expansion)  in  terms  of  the  singular  vectors  as  follows: 

r  T 

A  =  k=l  akUkV 

The  matrix  AT  has  m  singular  values,  the  positive  square  roots  of  the  eigenvalues  of  AAT. 
The  r  (=rank  (A))  nonzero  singular  values  of  A  and  AT  are,  of  course,  the  same.  The  choice 
of  aTa  rather  than  AAT  in  the  definition  of  singular  values  is  arbitrary .  Only  the  nonzero 
singular  values  are  usually  of  any  real  interest  and  their  number,  given  the  SVD,  is  the  rank 
of  the  matrix.  Naturally,  the  question  of  how  to  distinguish  nonzero  from  zero  singular 
values  in  the  presence  of  rounding  error  is  a  nontrivial  task. 

It  is  not  generally  advisable  to  compute  the  singular  values  of  A  by  first  finding 
the  eigenvalues  of  aTa  (remember  the  folk  theorem!),  tempting  as  that  is.  Consider  the 
following  example  with  t  a  real  number  with  ]  u  j  <  VT  (so  that  ft(l  +  y2)  =  1  where  fi(-) 

A  i\  A  A 


denotes  floating  point  computation) 


Let  A  = 


Then  f((A  A)  = 


compute  Si  =  /T,  02  =  0  leading  to  the  (erroneous)  conclusion  that  the  rank  of  A  is  1. 

T  A+U2  1 

Of  course,  if  we  could  compute  in  infinite  precision  we  would  find  A  A  =  / 


with  01  =-r2+g2,  =  ! U |  and  thus  rank  (A)  =  2.  The  point  is  that  by  working  with  ATA 

we  have  unnecessarily  introduced  u2  into  the  computations.  The  above  example  illustrates 
a  potential  pitfall  in  attempting  to  form  and  solve  the  normal  equations  in  a  linear  least 
squares  problem  and  is  at  the  heart  of  what  makes  square  root  filtering  so  attractive 
numerically.  See  [40]  for  further  details  and  references. 

Square  root  filtering  is  usually  implemented  using  QR  factorization  (or  some  closely 
related  algorithm)  as  described  previously  rather  than  SVD.  The  key  thing  to  remember  is 
that  in  most  current  computing  environments  the  condition  of  the  least  squares  problem  is 
squared,  unnecessarily,  in  solving  the  normal  equations  and,  moreover,  critical  information 
may  be  lost,  irrecoverably,  by  simply  forming  ATA.  These  caveats  may  not  be  of  such  great 
concern,  however,  if  one  has  available  certain  computing  environments  which  implement, 
for  example,  IEEE  arithmetic  with  extended  length  registers  (e.g.,  the  Intel  8087  floating¬ 
point  processor  chip) . 

Returning  now  to  the  SVD  there  are  two  features  of  this  matrix  factorization  that  make 
it  so  attractive  in  finite  arithmetic:  it  can  be  computed  stably  and  singular  values  are 
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well-conditioned.  Specifically,  there  is  an  efficient  and  numerically  stable  algorithm  t  j 
Golub  and  Reinsch  [41]  (based  on  [39]  )  which  works  directly  on  A  to  give  the  SVD.  The  c  >:r.:  utoi  "  an 
V  are  orthogonal  to  approximately  the  working  precision  and  the  computed  s  i  n;al  a  r  va  1  nos  car.  be 

shown  to  be  the  exact  r .  1  s  for  A+E  where  i s  a  modes t  mu  1 1  i v> le  of  &  .  Fairly  sot:h  i  s  *  i  r a  t  <_■  d  i m- 


shown  to  be  the  exact  7^  1  s  for  A+E  where  w  is  a  modest  multiple  of  &  .  Fa  i  r  1  y  so  j:  h  isM  cm  t  d  im- 

plementations  of  this  algorithm  can  be  found  in  [19]  and  [34],  The  well-conditioned  nature 
of  the  singular  values  follows  from  the  fact  that  if  A  is  perturbed  to  A  +  ?.  then 

|  a  ;  (A  +  E)  -  (A)  I  £  ;;  E . 

Thus  the  singular  values  are  computed  with  small  error  although  the  relative  errors  of 
sufficiently  small  singular  values  is  not  guaranteed  small. 

It  is  now  acknowledged  that  the  singular  value  decomposition  is  the  most  generally 
reliable  method  of  determining  rank  numerically  (see  [31!  for  a  more  elaborate  discussion!. 
However,  it  is  considerably  more  expensive  to  compute  than,  for  example,  the  TP  factoriza¬ 
tion  which,  with  column  pivoting  [19],  can  usually  give  equivalent  information  with  less 
computation.  Thus,  while  the  SVD  is  a  useful  theoretical  tool  its  use  for  actual  computa¬ 
tions  should  be  weighed  carefully  against  other  approaches. 


Only  rather  recently  has  the  problem  of  numerical  ieterm 
understood.  One  of  the  best  treatments  of  the  subject,  inclu 
numerical  rank,  is  a  paper  by  Golub,  Klema,  and  Stewart  [42], 
to  determine  a  "gap"  between  "zero”  and  the  "smallest  nonzero 
A.  Since  the  computed  values  are  exact  for  a  matrix  near  A  l 
rank  of  all  matrices  in  some  5-ball  (w.r.t.  the  spectral  norm 
choice  of  5  may  also  be  based  on  measurement  errors  incurred 
of  A  or  the  coefficients  may  be  uncertain  because  of  roundoff 
computation  to  get  them.  We  refer  to  [42]  for  further  detail 
that  even  with  SVD,  numerical  determination  of  rank  in  finite 
nontrivial  problem. 


ination  of  rank  been  we  1 - 
ding  a  careful  definition  of 
The  essential  idea  is  to  try 
singular  value"  of  a  matrix 
t  makes  sense  to  consider  the 
|j  • ,  say)  around  A.  The 
in  estimating  the  coefficients 
errors  incurred  in  a  previous 
s.  We  must  emphasize,  however, 
arithmetic  is  a  highly 


That  other  methods  of  rank  determination  are  potentially  unreliable  is  demonstrated 
by  the  following  example  which  is  a  special  case  of  a  general  class  of  matrices  studied  by 
Ostrowski  [43].  Consider  the  matrix  A  e  Rnxn  whose  diagonal  elements  are  all  -1,  whose 
upper  triangle  elements  are  all  +1,  and  whose  lower  triangle  elements  are  all  0.  This 
matrix  is  clearly  of  rank  n,  i.e.,  is  invertible.  It  has  a  good  "solid"  upper  triangular 
shape.  All  of  its  eigenvalues  (all  =  -1)  are  well  away  from  zero.  Its  determinant  is 
(_l)n  —  definitely  not  close  to  zero.  But  this  matrix  is,  in  fact,  very  near  singular 
and  gets  more  nearly  so  as  n  increases.  Notice,  for  example,  that 


-1  +1  .  .  .  +1 


2~n+l 


(n-*-  +  =») 


Moreover,  adding  2~n+1  to  every  element  in  the  first  column  of  A  gives  an  exactly  singular 
matrix.  Arriving  at  such  a  matrix  by,  say  Gaussian  elimination,  would  give  no  hint  as  to 
the  near-singularity.  However,  it  is  easy  to  check  that  an(A)  behaves  as  2~n+l.  A 
corollary  for  control  theory:  eigenvalues  don’t  necessarily  give  a  reliable  measure  of 
"stability  margin."  As  an  aside  it  is  useful  to  note  here  that  in  this  example  of  an  in¬ 
vertible  matrix,  the  crucial  quantity,  on  (A) ,  which  measures  nearness  to  singularity,  is 


simply 


and  the  result  is  familiar  from  standard  operator  theory.  There  is  nothing 


intrinsic  about  singular  values  in  this  example  and,  in  fact,  ||A"1||  might  be  more  cheaply 
computed  or  estimated  in  another  matrix  norm.  This  is  precisely  what  is  done  in  estimating 
the  condition  of  linear  systems  in  LINPACK  where  |j  •  ||  j_  is  used  [21]. 

Since  rank  determination,  in  the  presence  of  roundoff  error,  is  a  nontrivial  problem, 
all  the  same  difficulties  will  naturally  arise  in  any  problem  equivalent  to  or  involving 
rank  determination  such  as  determining  the  independence  of  vectors,  finding  subspace  bases, 
etc.  Such  problems  arise  as  basic  calculations  throughout  systems,  control,  and  estimation 
theory.  Selected  applications  are  discussed  in  more  detail  in  [38], 

Finally,  let  us  close  this  section  with  a  brief  example  illustrating  a  totally  in¬ 
appropriate  use  of  SVD.  The  rank  condition 

rank [ B ,  AB, . . . , An_ 1B ]  =  n  (21) 

for  the  controllability  of  (1)  is  (too)  well-known.  Suppose  A  =  ^  lj  ’  B  =  with 

|  u  |  </TT  Then  .  v 

f-  [  B  ,  AB  ]  =  (  | 


and  now  even  applying  SVD  the  erroneous  conclusion  of  uncontrollability  is  reached.  Again 
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the  problem  is  in  just  forming  AB;  not  even  SVD  can  come  to  the  rescue  after  that  numerical 
faux  pas. 

4.  APPLICATIONS 


In  this  section  we  shall  present  a  representative  selection  of  numerical  problems 
which  arise  in  linear  systems,  control,  and  estimation  theory  and  which  have  been  examined 
using  some  of  the  techniques  described  in  Sections  2  and  3.  Some  of  the  topics  are  de¬ 
scribed  in  more  detail  in  [38]  and  [44]  while  still  other  topics  are  surveyed  in  [45], 


4.1  Numerical  Solution  of  Linear  Ordinary  Differential  Equations 


The  "simulation"  or  numerical  solution  of  linear  systems  of  ordinary  differential 
equations  (ODE's)  of  the  form 


x  ( t )  =  Ax  (t)  +  f  ( t )  ;  x  (0  )  =  xQ  (22) 

is  a  standard  problem.  However,  there  is  still  debate  as  to  what  is  the  most  effective 
numerical  algorithm,  particularly  when  A  is  defective  (a  deficiency  of  eigenvectors)  or 
near-defective.  The  most  common  approach  involves  computation  of  the  matrix  exponential, 
etA.  A  delightful  survey  of  this  topic  is  given  in  [5].  Nineteen  "dubious"  ways  are 
explored  (there  exist  many  more  ways  which  are  not  discussed)  but  no  clearly  superior 
algorithm  is  singled  out.  Methods  based  on  Pad!1  approximation  or  reduction  of  A  to  real 
Schur  form  are  seen  as  generally  attractive  while  methods  based  on  Taylor  series  or  the 
characteristic  polynomial  of  A  are  generally  found  to  be  unattractive.  An  interesting 
open  problem  is  the  design  of  a  special  algorithm  for  the  matrix  exponential  when  the 
matrix  is  known  a  priori  to  be  stable  (A (A)  in  LHP). 


The  reason  for  the  adjective  "dubious"  in  the  title  of  [5]  is  that  in  many  (maybe 
even  most)  circun stances ,  it  is  better  to  treat  (22)  as  a  system  of  differential  equations, 
typically  stiff,  and  to  apply  various  ODE  techniques,  specially  tailored  to  the  linear 
case.  This  approach  is  discussed  in  [46].  ODE  techniques  are  to  be  preferred  when  A  is 
large  and  sparse  for,  in  general,  etA  will  be  unmanageably  large  and  dense.  The  relation¬ 
ship  between  ODE  techniques  and  matrix  exponential  techniques  when  A  has  an  ill-conditioned 
eigenstructure  or  when  the  "exponential  problem"  is  ill-conditioned  [47],  [48]  is  not  well- 

understood. 


Controllability  and  Other  "Abilities” 


Basic  to  the  study  of  linear  control  and  systems  theory  are  the  various  "abilities" 
such  as  controllability,  observability,  reachability,  reconstructibi lity ,  stabilizability , 
and  detectability  [1].  Our  remarks  here  will  be  confined,  but  are  not  limited,  to  the 
notion  of  controllability. 


A  large  number  of  algebraic  and  dynamic  characterizations  of  controllability  have 
been  given;  see  [44]  for  a  sample.  But  each  and  every  one  of  these  has  difficulties  when 
implemented  in  finite  arithmetic.  For  a  survey  of  this  topic  and  numerous  examples  see 
[49],  Part  of  the  difficulty  in  dealing  with  controllability  numerically  lies  in  the 
intimate  relationship  with  the  invariant  subspace  problem  [31].  The  controllable  subspace 
associated  with  (1)  is  the  smallest  A-invariant  subspace  containing  the  range  of  B.  Since 
A-invariant  subspaces  are  extremely  sensitive  to  perturbation,  it  follows  that  so  too  is 
the  controllable  subspace.  Similar  remarks  apply  to  the  computation  of  the  so-called  con¬ 
trollability  indices. 


Recently,  attempts  have  been  made  to  provide  numerically  stable  algorithms  for  the 
pole  placement  problem;  we  would  cite  [50]- [53]  as  examples  (only  one  representative  and 
recent  reference  is  chosen  for  each  author  or  group) .  The  methods  are  based  on  reduction 
of  A  to  a  Hessenberg  form  rather  than  a  controllable  or  Luenberger  canonical  form  which 
is  known  to  be  numerically  unstable  [9].  For  example,  in  the  single-input,  single-output 
case  it  can  easily  be  shown,  using  the  tools  developed  in  Section  3.1,  that  there  exists 
an  orthogonal  transformation  U  such  that  uTau  is  upper  Hessenberg  and  UTB  =  a  multiple  of 
(1,0, . . . ,0)T.  The  pair  (A,B)  is  then  controllable  if  and  only  if  all  (n-1)  subdiagonal 
elements  of  uTau  are  nonzero.  If  a  subdiagonal  element  is  0,  the  system  is  uncontrollable 
and  a  basis  for  the  uncontrollable  subspace  is  easily  constructed.  The  transfer  function 
gain  or  first  nonzero  Markov  parameter  is  also  easily  constructed  from  this  "canonical 
form."  In  fact,  the  numerically  more  robust  general  Hessenberg  form  will  probat ly  play 
an  ever-increasing  role  in  systems  theory  in  replacing  the  numerically  more  fragile 
special  case  of  the  companion  or  rational  canonical  or  Luenberger  canonical  form. 


A  more  important  aspect  of  controllability  is  to  better  understand  topological  notions 
such  as  "near-uncontrollability."  But  there  are  numerical  difficulties  lurking  here,  also, 
and  we  refer  to  [49]  for  further  details.  Related  to  this  is  an  interesting  new  system- 
theoretic  concept  called  "balancing";  see  [54].  The  computation  of  "balancing  transform¬ 
ations"  is  discussed  in  [55], 


There  are  at  least  two  distinct  notions  of  near-uncontrollability:  in  the  parametric 
sense  and  in  the  energy  sense.  In  the  parametric  sense  a  controllable  pair  (A,B)  is 
said  to  be  near-uncontrollable  if  the  parameters  of  (A,B)  need  be  perturbed  by  only  a 
relatively  small  amount  to  become  uncontrollable.  In  the  energy  sense,  a  controllable 
pair  is  near-uncontrollable  if  large  amounts  of  control  energy  (  /u”u)  are  required  to 
effect  a  state  transfer.  The  pair 
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is  near-uncontrollable  in  the  energy  sense  but  apparently  not  in  the  parametric  sense.  Of 
course,  both  measures  are  co-ordinate  dependent  and  "balancing"  is  one  attempt  to  try  to 
remove  this  co-ordinate  bias. 

4. 3  Computation  of  Objects  Arising  in  the  Geometric  Theory  of  Linear  Multivariable  Control 

A  great  many  numerical  problems  arise  in  the  geometric  approach  [1]  to  control  of 
systems  modeled  as  (1),  (2).  Some  of  these  are  discussed  in  [38]  and  [1]  remains  a 
fertile  source  of  numerical  problems.  In  fact,  the  power  of  the  geometric  approach  derives 
in  large  part  from  its  divorce  from  matrices  and  specific  co-ordinate  systems.  Numerical 
issues  are  a  separate  concern.  Two  of  the  more  elaborate  but  still  fundamental  objects 
in  that  theory  are  supremal  (A,B)  -  invariant  and  controllability  subspaces  contained  in 
a  given  subspace.  An  initial  attempt  at  characterizing  these  spaces  in  terms  of  eigen¬ 
vectors  and  a  generalized  eigenvalue  problem  was  given  in  [56], 

However,  a  rather  different  and  very  thorough  numerical  treatment  of  the  problem  has 
been  done  by  Van  Dooren  [57],  [58].  He  has  done  the  most  definitive  numerical  study  to 
date  of  the  matrix  pencil  (L-lM)  problem.  This  work  has  implications  for  most  calculations 
done  with  linear  state-space  models.  For  example,  one  by-product  is  an  extremely  reliable 
algorithm  (which  amounts  to  an  orthogonal  version  of  Silverman's  structure  algorithm)  for 
the  computation  of  multivariable  system  zeros  [59],  Like  [37]  this  method  involves  a 
generalized  eigenvalue  problem  (the  Rosenbrock  pencil)  but  the  "infinite  zeros"  are  first 
deflated  out. 

4 . 4  Frequency  Response  Calculations 

Many  properties  of  a  linear  system  (1),  (2)  are  known  in  terms  of  its  frequency 

response  matrix 

G  ( j  to)  :=  C(jtol-A)-1  B  +  D;  (toiO)  (23) 

(or  G(e39);  ee[0,  2tt ]  for  (3),  (4)).  In  fact,  various  norms  of  the  return  difference 

matrix  I  +  G(jco)  and  related  quantities  have  recently  been  investigated  as  providing  meas¬ 
ures  of  robustness  of  a  linear  system  with  respect  to  stability,  noise  response,  distur¬ 
bance  attenuation,  sensitivity,  etc.  See  [60]  for  some  of  the  numerical  aspects  and  [61], 
[62]  for  surveys  of  some  of  the  control  aspects. 

It  is  thus  a  problem  of  considerable  computational  interest  to  efficiently  compute 
G ( jw) ,  given  A , B ,  and  C,  for  a  (possibly)  large  number  of  values  of  w  (without  loss  of 
generality,  D  can  be  taken  to  be  0) .  A  generally  applicable  algorithm  for  this  problem  is 
presented  in  [63],  Rather  than  solve  the  linear  equation  (with  dense,  unstructured  A) 
(jwI-A)X  =  B  which  would  require  0(mn2)  operations  for  each  value  of  w,  the  new  method  does 
an  initial  reduction  of  A  to  upper  Hessenberg  form,  H.  The  orthogonal  matrices  used  to 
effect  the  Hessenberg  form  of  A  are  incorporated  into  B  and  C  giving  B  and  C.  Now  as  w 
varies,  the  coefficient  matrix  in  the  linear  equation  (jwI-H)  X  =  B  remains  in  upper 
Hessenberg  form.  The  advantage  is  that  X  can  now  be  found  in  0(mn2)  operations  rather 
than  O(mn^)  as  before,  a  substantial  savings.  Moreover,  the  method  is  numerically  very 
stable  and  has  the  advantage  of  being  independent  of  the  eigenstructure  (possibly  ill- 
conditioned)  of  A. 

Portable  mathematical  software,  in  the  sense  to  be  discussed  in  Section  5,  is  also 
available  for  this  problem  [64]. 

We  note  here  that  the  above  method  can  also  be  extended  to  state-space  models  in  im¬ 
plicit  form,  e.g.,  (1)  is  replaced  by 


Ex  =  Ax  +  Bu. 


(24) 


Then  (23)  is  replaced  with 

G(jco)  =  C  ( jwE  -  A)-1  E  +  D  (25) 

and  the  initial  triangular/Hessenberg  reduction  employed  in  [36]  can  be  employed  to  again 
reduce  the  problem  to  one  of  updating  the  diagonal  of  a  Hessenberg  matrix  and  consequently 
an  0(n2)  linear  equation  problem. 

4 . 5  Lyapunov,  Sylvester,  and  Riccati  Equations 

Certain  matrix  equations  arise  naturally  in  linear  control  and  systems  theory.  Among 
those  frequently  encountered  in  the  analysis  of  continuous-time  systems  are  the  Lyapunov 

equation 

FX  +  XFT  +  H  =  0,  (26> 


It  -  -J 


I  -1 


and  the  Sylvester  equation 


FX  +  XG  +  H  =  0. 


(27) 


X  +  H 


0 


(28) 


The  appropriate  discrete-time  analogues  are 

T 

FXF  - 

FXG  -  X  +  H  =  0.  (29) 

Various  hypotheses  are  made  on  the  coefficient  matrices  F,  G,  H  to  ensure  certain  properties 
of  the  solution  X. 

Surprisingly  little  attention  has  been  paid  to  solution  of  these  equations  in  the 
numerical  linear  algebra  literature.  There  is,  however,  a  voluminous  literature  in  control 
and  systems  theory  but  most  of  that  is  ad  hoc,  at  best,  from  a  numerical  point  of  view, 
with  little  attention  paid  to  questions  of  numerical  stability,  conditioning,  machine 
implementation,  and  the  like. 

For  the  Lyapunov  equation  the  overall  best  algorithm  in  terms  of  efficiency,  accuracy, 
reliability,  availability,  and  ease  of  use  appears  to  be  that  of  Bartels  and  Stewart  [65], 
The  basic  idea  is  to  reduce  F  to  quasi-upper-triangular  form  (or  real  Schur  form  (RSF ) )  and 
perform  a  back  substitution  for  the  elements  of  X. 

For  the  Sylvester  equation  the  Bartels-Stewart  algorithm  reduces  both  F  and  G  to  real 
Schur  form  (RSF)  and  then  a  back  substitution  is  done.  It  has  been  demonstrated  in  [66] 
that  some  improvement  in  this  procedure  is  possible  by  only  reducing  the  larger  of  F  and  G 
to  upper  Hessenberg  form. 

A  promising  new  algorithm  for  solving  Lyapunov  equations  has  recently  been  proposed  by 
Hammarling  [67].  This  algorithm  is  a  variant  of  the  Bartels-Stewart  algorithm  which  solves 
directly  for  the  Cholesky  factor  Y  of  X  :  YTY  =  X  and  Y  is  upper-triangular.  Clearly, 
given  Y,  X  is  easily  recovered  if  necessary.  But  in  many  applications,  for  example  [55], 
only  the  Cholesky  factor  is  required. 

Open  questions  remain  concerning  estimating  the  condition  of  Lyapunov  and  Sylvester 
equations  efficiently  and  reliably  in  terms  of  the  coefficient  matrices. 

A  deeper  analysis  of  the  Lyapunov  and  Sylvester  problems  is  probably  a  prerequisite 
to  at  least  a  better  understanding  of  conditioning  of  the  Riccati  equation  for  which  again, 
there  is  a  considerable  theoretical  literature  but  rather  little  known  from  a  purely  nu¬ 
merical  point  of  view.  The  symmetric  algebraic  Riccati  equation  takes  the  form 

XGX  +  FX  +  XFT  +  H  =  0  (30) 

for  continuous- time  systems  and 

FXFT  -  X  -  FXG1(G2  +  G^XGj^)  _1G^XFT  +  H  =  0  (31) 

for  discrete-time  situations.  Again,  appropriate  assumptions  are  made  on  the  coefficient 
matrices  to  guarantee  the  existence  and/or  uniqueness  of  certain  kinds  of  solutions  X. 
Nonsymmetric  Riccati  equations  of  the  form 

XGX  +  Fj^X  +  XF2  +  H  =  0  (32) 

for  the  continuous-time  case  (along  with  an  analog  for  the  discrete-time  case)  are  also 
studied  and  can  be  solved  numerically  by  the  techniques  discussed  below. 

One  of  the  more  reliable  general-purpose  methods  for  solving  Riccati  equations  is  the 
Schur  method  [68].  For  the  case  of  (30),  for  example,  this  method  is  based  upon  the  reduc¬ 
tion  of  the  associated  Hamiltonian  matrix 


to  RSF.  If  the  RSF  is  ordered  so  that  its  stable  eigenvalues  (there  will  be  exactly  n  of 
them  under  certain  assumptions)  are  in  the  upper  left  triangle,  the  corresponding  first  n 
vectors  of  the  orthogonal  matrix  which  effects  the  reduction  will  form  a  basis  for  the 
stable  eigenspace  from  which  the  Riccati  solution  is  then  easily  found. 

Extensions  to  the  basic  Schur  method  have  been  made  [69],  [70]  which  were  motivated 

by  the  following  situations: 

—  1  T 

(i)  G  in  (30)  is  of  the  form  BR  B  where  R  may  be  near-singular. 

(ii)  F  in  (31)  is  singular  (F~l  is  required  in  the  classical  approach  involving 
a  symplectic  matrix  which  plays  the  role  of  (33)). 

In  fact,  these  extensions  can  be  generalized  even  further  and  the  following  problem  will 
illustrate.  Consider  the  optimal  control  problem 

Min  i  /+  °°  [xV  +  2xTSu  +  uTRu]dt  (34) 

o 

subject  to  Ex  =  Ax  +  Bu.  (35) 

The  Riccati  equation  associated  with  (34),  (35)  then  takes  the  form 


0 


T— IT  — ITT  T  —IT  — IT 

E  XBR  B  XE  -  (A  -  BR  S  )  XE  -  E  X  (A  -  BR  S  )  -  Q  +  SR  S  = 


(36) 


This  so-called  "generalized"  Riccati  equation  can  be  solved  by  considering  the  associated 
matrix  pencil 


-A 

B1 


0 


(37) 


Notice  that  S  in  (34)  and  E  in  (35)  are  handled  directly  and  no  inverses  appear.  Numerical 
methods  for  handling  \37)  and  a  large  variety  of  related  problems  are  described  in  [70], 

[71]  and  a  thorough  survey  of  the  Schur  method,  generalized  eigenvalue/eigenvector  exten¬ 
sions,  and  the  underlying  algebraic  structure  in  terms  of  " Hami 1 tonian  pencils"  and  "sym- 
plectic  pencils"  is  described  in  [72]. 


Schur  techniques  can  also  be  applied  to  Riccati  differential  and  difference  equations 
[73]  and  to  nonsy,umetr ic  Riccati  equations  which  arise  in,  for  example,  invariant  imbedding 
methods  for  solving  linear  two-point  boundary  value  problems  [n2]. 


As  with  the  linear  Lyapunov  and  Sylvester  equations  there  are  few  satisfactory  results 
concerning  conditioning  of  Riccati  equations,  a  topic  of  great  interest  independent  of  what 
solution  method  is  used,  be  it  a  Schur-type  method  or  one  of  numerous  alternatives.  One 
fairly  reliable  method  is  to  estimate  the  condition  of  Un  with  respect  to  inversion  where 

j  is  a  basis  for  the  stable  eigenspacu.  This  turns  out  to  be  essentially  equivalent 

to  the  somewhat  more  elaborate  procedure  proposed  in  [74].  But  it  is  easy  to  provide 
examples  of  ill-conditioned  Riccati  equations  where  Un  is  wel 1 -conditioned  and  so  a  much 
more  sophisticated  analysis  needs  to  be  performed  and  efforts  are  under  way  by  numerous 
groups  towards  this  end. 


A  software  package  for  Riccati  equations  called  RICPACK  has  been  partially  completed 
(Jan.  1983)  by  this  author  and  W.F.  Arnold.  Highlights  of  the  capabilities  of  this  general 
software  (in  Fortran)  include: 


(i)  Ward's  balancing  [75]  for  the  generalized  eigenvalue  problem 

(ii)  direct  handling  of  singular  control  weighting  or  measurement  noise  covariance 

(iii)  direct  handling  of  cross-weighting  or  noise  correlation 

(iv)  direct  handling  of  descriptor  variable  systems 

(v)  spectral  factorization 

(vi)  iterative  refinement  by  Newton's  method  and  Sylvester  equations 

(vii)  residual  calculations  and  condition  estimates 

5 .  MATHEMATICAL  SOFTWARE 

5.1  General  Remarks 


The  previous  two  sections  have  highlighted  some  topics  from  numerical  linear  algebra 
and  their  applications  to  numerical  problems  arising  in  systems,  control,  and  estimation 
theories.  Of  course,  these  problems  represent  only  a  very  small  subset  of  numerical 
problems  of  interest  but  even  for  problems  which  are  apparently  "simple"  from  a  mathematical 
point  of  view,  the  myriad  of  little  details  which  constitute  a  sophisticated  implementation 
become  so  overwhelming  that  the  only  effective  means  of  communicating  an  algorithm  is 
through  its  embodiment  as  mathematical  software.  Mathematical  or  numerical  software  simply 
means  an  implementation  on  a  computing  machine  of  an  algorithm  for  solving  a  mathematical 
problem.  Such  software  must  be  reliable,  portable,  and  unaffected  by  the  machine  or 
system  environment  in  which  it  is  used. 

The  prototypical  work  on  reliable,  portable  mathematical  software  for  the  standard 
eigenproblem  was  started  in  1968.  EISPACK  [10],  [34]  Editions  I  and  II  were  an  outgrowth 

of  that  work.  Subsequent  efforts  of  interest  to  control  engineers  include  LINPACK  [19]  for 
linear  equations  and  linear  least  squares  problems,  FUNPACK  (Argonne)  for  certain  function 
evaluations,  MINPACK  (Argonne)  for  certain  optimization  problems,  ROSEPACK  for  robust 
statistical  estimation,  and  various  ODE  and  PDE  codes.  High  quality  algorithms  are  pub¬ 
lished  regularly  in  the  ACM  Transactions  on  Mathematical  Software. 

Moreover,  many  pre-processors  that  are,  themselves,  portable  software  have  been  de¬ 
signed  and  implemented  to  assist  in  instituting  and  veri tying  portability.  Among  such 
machine  aids  that  are  in  use  are  the  PFORT  verifier  [76]  f . om  Bell  Labs,  and  the  Fortran 
Converter  from  International  Mathematical  and  Statistical  Libraries,  Inc.  [77],  Also 
available  are  machine  aids  such  as  TAMPR  and  POLISH.  Technology  to  aid  in  the  development 
of  mathematical  software  in  Fortran  is  being  assembled  as  a  package  called  TOOLPACK 
(University  of  Colorado).  Mechanized  code  development  offers  other  advantages  with  respect 
to,  for  example,  modifications,  updates,  versions,  and  maintenance.  Excellent  references 
on  portability  and  other  aspects  of  mathematical  software  include  [  7  8 ] — [ 80  ]  . 

Inevitably  numerical  algorithms  are  strengthened  when  their  mathematical  software  is 
made  portable  since  their  widespread  use  is  greatly  facilitated.  Furthermore,  such  soft¬ 
ware  has  been  shown  to  be  markedly  faster  by  factors  ranging  from  10  to  50  than  earlier 
and  less  reliable  code. 


One  can  list  many  other  features  besides  portability,  reliability,  and  efficiency 
which  are  characteristic  of  "good"  mathematical  software.  For  example,  one  can  include: 

(i)  high  standards  of  documentation  and  style 

(ii)  ease  of  use;  ability  of  the  user  to  interact  with  the  algorithm 

(iii)  consistency/compatibility/modularity  in  the  context  of  a  larger  package 
or  more  complex  problem 

(iv)  error  control,  exception  handling 

(v)  robustness  with  respect  to  unusual  situations 

(vi)  graceful  performance  degradation  as  problem  domain  boundaries  are  approached 

(vii)  program  size  (a  function  of  intended  use:  e.g.  low  accuracy,  real-time 
applications) 

(viii)  availability  and  maintenance 

(ix)  "tricks"  such  as  under flow-/over flow-  proofing  if  necessary  and  implementa¬ 
tion  of  columnwise  or  rowwise  linear  algebra  [81]. 

Clearly,  the  list  can  go  on. 

What  becomes  apparent  from  the  above  considerations  is  that  the  evaluation  of  mathemat 
ical  software  is  a  highly  nontrivial  task.  Clearly  the  quality  of  software  is  largely  a 
function  of  its  operational  specification.  It  must  also  reflect  the  numerical  aspects  of 
the  algorithm  being  implemented.  The  language  used  and  the  compiler  (e.g.,  optimizing  or 
not)  used  for  that  language  will  both  have  an  enormous  impact  on  quality  -  both  perceived 
and  real  as  will  the  underlying  hardware  and  arithmetic.  Different  implementations  of  an 
algorithm  can  have  markedly  different  properties  and  behavior  —  even  of  the  same  good 
underlying  algorithm.  Further  discussions  on  this  subject  can  be  found  in  (79],  [80]  and 
(82]. 

5.2  Mathematical  Software  in  Control 

Many  aspects  of  systems,  control,  and  estimation  theory  are  ready  for  the  research  and 
design  that  is  necessary  to  produce  reliable,  portable  mathematical  software  that  performs 
in  finite  arithmetic.  Certainly  many  of  the  underlying  linear  a!  jebra  tools  (for  example, 
in  EISPACK  and  LINPACK)  are  considered  sufficiently  reliable  as  to  be  used  as  black  —  or  at 
least  gray  —  boxes  by  control  engineers.  Much  of  that  theory  and  methodology  can  and  has 
been  carried  over  to  control  problems.  However,  much  of  the  work  done  in  control,  partic¬ 
ularly  the  design  and  synthesis  aspects,  is  not  amenable  to  nice,  "clean"  algorithms  and 
the  ultimate  software  must  have  the  capability  to  enable  a  dialogue  between  the  computing 
machine  and  the  control  engineer  but  with  the  latter  probably  still  making  the  final  en¬ 
gineering  decisions.  We  might  never  see  a  "control  package”  (  CONPACK)  that  will  look 
like  EISPACK  or  LINPACK.  To  even  attempt  it  would  be  a  CONJOB.  Instead,  a  better  analogy 
would  be  made  by  trying  to  emulate  a  good  ODE  or  PDE  package. 

What  mathematical  software  can  provide  is  a  "toolbox"  from  which  the  control 
engineer  can  choose  software  tools  and  robustly  coded  algorithms  to  easily  implement  new 
or  modified  theories  or  designs.  Mathematical  software  forms  the  foundation  of  a  computer- 
aided  control  system  design  (CACSD)  package  but  is  only  one  of  many  interlocking  parts. 

Most  CACSD  packages  —  and  there  are  now  hundreds  of  them  —  divide  fairly  naturally 
into  two  fundamental  levels,  each  with  further  subdivisions,  of  course.  The  lower  level 
contains  the  numerical  software  and  this  can  be  written  very  portably.  The  upper  level 
contains  the  basic  design  and  analysis  procedures  which  call  the  low  level  procedures  or 
subroutines  for  their  actual  implementation.  Also  in  the  upper  level  is  the  key  part,  as 
far  as  the  control  engineer  is  concerned, and  that  is  the  user  interface.  This  interface 
interacts  with  the  upper  level  procedures  as  well  as  the  1/0,  graphics,  and  file  and  data¬ 
base  management  systems.  Here  the  question  of  portability  is  considerably  more  complex 
and  most  packages  aim  for  particular  "target  environments".  For  further  comments  and 
examples  see  [83],  [84]. 

Finally,  we  mention  a  few  recent  developments  which  are  relevant  to  future  control 
and  estimation  software  developments.  With  respect  to  languages,  Fortran  is  likely  to 
remain  the  most  common  (and  very  efficient)  language  for  quite  some  time.  Fortran  coding 
and  portability  is  certain  to  be  aided  by  the  adoption  and  use  of  the  Fortran  77  standard 
(ANSI  X3. 9-1978)  and  work  continues  on  Fortran  8X.  We  may  also  see  some  movement  towards 
Ada™.  The  use  of  other  languages  such  as  Pascal,  C,  PL1,  and  APL  seems  generally  limited 
despite  their  attractiveness  in  particular  environments.  In  hardware  we  are  seeing  more 
and  more  microprocessor-based  systems.  Parallel  architectures  will  both  demand  and 
suggest  new  algorithms  and  control  strategies.  In  arithmetic  the  implementation  of  IEEE 
arithmetic  in  some  computing  environments  could  have  a  major  beneficial  impact  on  mathe¬ 
matical  software.  In  graphics  substantial  progress  is  being  made  towards  development  of 
standards. 


6.  CONCLUDING  REMARKS 

Some  numerical  issues  and  techniques  from  numerical  linear  algebra  together  with  some 
applications  of  these  ideas  have  been  outlined.  A  key  question  in  these  and  other  problems 
in  systems,  control,  and  estimation  theory  is  what  can  be  reliably  computed  and  used  in  the 
presence  of  parameter  uncertainty  or  structural  constraints  (e.g.,  certain  "hard  zeros') 
in  the  original  model  and  roundoff  errors  in  the  calculations.  However,  the  ultimate  goal 
is  to  solve  real  problems  and  reliable  tools  (mathematical  software)  and  experience  must 
be  available  to  effect  real  solutions  or  strategies.  Only  a  serious  interdisciplinary 

Ada  is  a  trademark  of  the  U.S.  Department  of  Defense 


effort  is  capable  of  making  substantial  progress  in  improving  the  present  state  of  aft  i::s. 
As  we  move  out  of  the  "just  programming"  era  we  can  expect  to  see  soon  some  high  quality 
control  software.  We  have  already  witnessed  a  fruitful  symbiosis  between  n-.inu-i  1  al  .mays  i  s 
and  numerical  problems  from  control.  We  can  expect  a  further  symbiotic  relationshi;  as 
control  engineering  realizes  the  full  potential  of  graphics,  “cheap"  memory,  an:  substantial 
computing  power.  However,  as  in  other  applications  areas,  software  will  continue  t ,  > 
dominate  both  as  a  constraint  and  as  a  vehicle  for  progress.  Unfortunately,  exceptionally 
high  quality  software  is  exceptionally  expensive,  in  terms  of  both  money  an:  time. 
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SUMMARY 

A  recent  formulation  of  the  feedback  control  problem  has  been  proposed  whicn  captures  botn 
performance  and  robustness  aspects  of  feedback.  The  structured  singular  value  provides  the  solution  to 
this  problem.  This  paper  reviews  the  new  problem  formulation  and  solution  and  extends  its  applicaoi lity 
to  digital  feedback  control  systems.  A  digital  compensator  is  treated  as  though  it  were  an  analog 
compensator  through  the  use  of  sectors.  An  integrated  flight-propulsion  control  system  is  used  as  an 
illustrative  example. 

1.  INTRODUCTION 

The  basic  requirement  of  feedback  systems  is  to  achieve  certain  desired  levels  of  performance  and 
also  to  be  tolerant  of  uncertainties.  Performance  levels  concern  such  things  as  command  following, 
disturbance  rejection,  sensitivity,  etc.,  while  uncertainty  tolerances  deal  witn  the  inevitaole 
differences  which  exist  between  a  physical  plant  and  its  mathematical  model.  As  discussed  in  various 
textbooks  and  references,  these  two  aspects  of  the  feedback  problem  lead  to  fundamental  tradeoffs  and 
compromises  which  motivate  the  entire  body  of  feedback  theory.  Incorporation  of  a  digital  computer  does 
not  alter  this  basic  requirement. 

An  essential  difficulty  in  the  theory  has  been  to  capture  both  the  performance  and  uncertainty 
aspects  of  feedback  in  a  single  problem  statement.  Thus  we  have  optimization  theories  which  emphasize 
performance,  robustness  theories  which  emphasize  uncertainties,  and  a  host  of  ad  hoc  tools  whicn  attempt 
to  compromise  the  two.  A  recent  problem  formulation  has  been  proposed  which  captures  both  aspects  of 
feedback  under  the  umbrella  of  what  is  called  the  "block-diagonal  bounaed  perturbation  proDlem."  Tne 
solution  to  this  problem  involves  a  generalization  of  the  ordinary  singular  value  decomposition.  It 
provides  a  reliable,  nonconservative  measure  to  determine  whether  both  the  performance  and  robustness 
requirements  of  a  feedback  loop  are  satisfied.  This  measure  is  called  the  structured  singular  value  and 
serves  as  the  essential  analysis  tool.  One  of  the  major  goals  of  this  paper  is  to  disseminate  this 
problem  formulate. 

The  second  goal  of  this  paper  is  to  expound  a  metnod  of  analysis  for  digital  control  systems  that 
uses  the  theory  of  sectors.  When  used  in  a  feedback  configuration,  a  digital  computer  is  embedded  in  a 
compensator  that  consists  of  a  prefilter,  a  sampler,  the  computer,  and  a  hold  device.  Al enough  such  a 
compensator  has  a  digital  components,  it  transforms  an  analog  input  signal  into  an  analog  output 
signal.  An  accurate  model  of  its  input/output  behavior  requir-  s  an  analog,  time-varying  operator.  Such 
a  model  is  too  complicated,  however,  for  easy  use  in  control  system  design  and  analysis.  The  theory  of 
sectors  provides  linear,  time-invariant  approximations  for  digital  controllers.  A  fundamental  theorem 
of  Thompson[l]  is  used  extensively  in  this  development. 

The  basic  idea  of  sectors  is  that  a  very  complicated  operator  can  be  reliably  approximated  by  a 
simple  "center",  provided  that  the  approximation  error  is  properly  accounted.  The  "radius"  of  the 
sector  bounds  the  errors  of  this  approximation.  Finite-dimensional,  linear,  time-invariant  operators 
will  be  presented  for  the  center  and  radius  of  a  sector  that  describes  a  quite  general  digital 
compensator.  The  error  introduced  by  this  approximation  is  in  addition  to  the  usual  modeling  error 
associated  with  the  controlled  plant.  The  use  of  sectors  together  with  the  structured  singular  value 
successfully  addresses  simultaneous  performance  and  robustness  issues. 

The  paper  is  organized  Into  seven  major  sections.  Section  2  provides  a  review  of  some  requisite 
background  material.  Section  3  shows  how  to  use  conic  sectors  In  the  analysis  of  hybrid  compensators. 

Yhfs  work  has  been  supported  by  Honeywell  Internal  Research  and  Development  Funding,  The  Office  of 
Naval  Research  under  ONR  Research  Grant  N00014-82-C-0157,  and  the  U.S.  Air  Force  Office  of  Scientific 
Research  Grant  F 49620-82 -C-0090. 
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In  Section  4,  the  robustness  and  performance  aspects  of  feedback  are  formulated  as  a  stability  problem 
involving  block-diagonal  bounded  perturbations.  This  problem  is  solved  in  Section  b  using  the  new 
structured  singular  value  concept.  These  ideas  are  illustrated  through  the  example  problem  of  an 
integrated  control  mode  for  an  advanced  fighter  aircraft  in  Section  6.  Section  7  contains  concluding 
remarks. 

2.  MATHEMATICAL  PRELIMINARIES 
2.1  Topics  from  Functional  Analysis 

The  foundation  for  the  development  in  this  paper  is  provided  by  a  review  of  a  number  of  basic 
concepts  from  functional  analysis.  The  first  of  these  is  the  normed  linear  space  L2xm  .  This 
function  space  is  the  collection  of  all  rxm-dimensional  functions  which  are  square  integrable  on  R.  An 
inner  product  for  any  two  functions  x  and  y  in  L2xm  is  defined  as 

u 

<x,y>  =  t  Tr[y"(t)x(t)]dt  (2.1) 

The  norm  associated  with  this  inner  product  is 


11X11  _  =  <x,x> 


Elements  of  L2  have  finite  norm. 


The  chief  limitation  of  the  space  L™  for  control  system  analysis  is  that  it  contains  no 
unstable  functions,  i.e.,  functions  with  « x« 2  =  ».  This  can  be  remedied  by  introducing  the 
extended  normed  linear  space  This  space  is  the  collection  of  all  functions  which  are  square 

integrable  on  all  finite  intervals  of  R.  More  precisely,  we  introduce  the  truncated  norm  as 


iixii -  ,  =  {/  Tr[xH(t)x(t)]dt)  1/2 

-T 


Then  L,  contains  all  functions  x:R-C  which  satisfy  “xh,  <  -  for  all  t. 

L  1  C,T 

Functions  such  as  x(t)-e  are  included  in  L^,  for  example,  while  functions  sucn  as  x(t)  =  taint) 


are  not.  Note  that  all  elements  of  L~xm  are  included  in  the  extension  L™m 

c  Ze 


and  have  the 


property  that  i*xn 2  T  *  «xn2  as 


The  other  function  space  we  will  need  is  L_  .  This  space  consists  of  all  functions  which 
are  measurable  and  essentially  bounded.  It  is  a  normed  linear  space  witn  norm 

nxn—  *  ess  sup  o  (x(t))#  (2.4 

No  distinction  will  be  made  between  functions  differing  over  sets  of  measure  zero  and  in  the  sequel  we 
will  use  sup  for  the  essential  supremum. 

The  Fourier  transform  of  a  function  x  is 

A  *♦»  • 

x(j«)  =  /  x(t)e’Jwtdt  (2.5 


The  operation  of  the  Fourier  transform  is  a  linear  isometry  of  L™1"  onto  L™'"  .  The 
Parseval  formula  relates  inner  products 


and  specializes  to 


<x,y>  1  -j,  <x.  y> 


2  1*2 
"x"2  *  2*  "x"  2 


An  operator  G  is  a  mapping  which  associates  which  each  function  in  its  domain  exactly  one  function 
in  its  range.  For  our  purposes,  the  domain  of  an  operator  will  be  L2e  and  its  range  will  be  some 

I  the  symbol  o(*)  denotes  the  maximum  singular  value  of  a  matrix.  The 
singular  values  of  a  matrix  A  are  the  non-negative  square  roots  of  the 
eigenvalues  of  the  Hermitian  form  AHA. 


subset  of  L~ 


The  mapping  of  an  operator  is  denotea 

y  =  Gx 


(2.8) 
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It  is  assumed  that  G  is  a  causal  operator.  Causality  means  that  the  output  of  G  at  time  t,  does  not 
depend  on  values  of  the  input  at  future  times,  say  t^t^. 

m  r 

An  operator  has  a  norm  induced  by  the  norms  on  |_2e  and  L^.  The  induced  operator  norm  is 

iiGxii 

hGh?  =  sup  sup  -jrnj — —  (2.9) 

2  t  hx«2>t=0  "x  2,t 

It  is  a  common  abuse  of  notation  to  use  n * n ^  to  denote  both  function  norms  and  the  induced  operator 
norm.  The  distinction  is  made  clear  by  the  arguments  used  witn  the  symbol. 

The  causal  operator  G  is  said  to  be  L?  staole  if  it  has  finite  gain,  i.e.,  »G“2  <  -. 

m  r 

Thus  L2e  stable  operators  map  L2  into  L2> 

He  now  restrict  attention  to  linear  operators  G  defined  by  the  convolution  inteyral 

m 

( Gx ) ( t )  =  ;  g(t-T)  x(t )dx  (2. 1 U) 

where  g  is  absolutely  integraDle.  Fubini's  theorem  provides  a  relationship  between  the  Fourier 
transform  of  y=Gx  and  the  transforms  of  g  and  x, 

y  =  gx  (2.11) 

We  refer  to  g  as  the  transfer  matrix  of  the  operator  G.  The  Fourier  transform  converts  the  original 
convolution  into  multiplication  in  the  transformed  (frequency)  domain.  Moreover,  the  Fourier  transform 
relates  the  norm  of  an  L2e  stable  operator  G  to  the  transform  g.  By  the  Parseval  formula  (2.7), 

nGxii  ->  »gxn2 

"G"p  =  sup  ITT-  =  sup  -  (2J2) 

mxm2j<  0  2  « x n 2 ^  o"xn2 

In  fact,  it  can  be  shown  [6]  that 

iiG'i 2  =  iign^  =  sup  o  (g(j«))  (2.13) 

Thus  for  stable  convolutional  operators,  the  norm  is  simply  equal  to  the  supremum  over  frequency  of  the 
largest  singular  value  of  the  transfer  matrix.  This  makes  convolutional  operators  into  a  normed  linear 
algebra. 


2.2  Performance  Measures  for  Linear  Systems 

Two  alternate  measures  of  performance  for  linear  systems  are  related  to  tne  basic  concepts  of 
functional  analysis  in  this  section.  Broadly  speaking,  "good"  performance  means  that  some  error 

response  is  "small"  in  an  appropriate  sense.  An  example  of  such  an  error  response  is  the  classical 

output  sensitivity  function  of  a  feedback  loop  which  relates  command  following  errors  to  output 
commands.  Another  example  is  the  response  at  the  input  of  a  plant  to  sensor  noise  on  the  measured 

variable.  We  will  characterize  an  error  response  as  an  operator  and  measure  its  size  through  norms  on 

m  r 

the  operator.  The  operator  is  denoted  G  and  maps  inputs  u  in  L?  into  outputs  y  in  l2> 

The  first  type  of  performance  considered  is  stochastic  performance,  i.e.  the  statistical  benavior  of 
the  error  subject  to  random  inputs.  If  the  input  u  is  zero  mean  white  noise  witn  unit  intensity,  then 
the  covariance  of  the  error  is  simply 

E(iy(t)'2)  *  -  ngii22  (2.14) 

This  performance  measure  is  the  integral  in  the  frequency  domain  of  Tr(g  g).  More  generally,  tne 
input  can  be  colored  noise  formed  by  shaping  white  noise  through  the  operator  K'1.  Also,  we  have  the 
freedom  to  examine  the  covariance  of  a  filtered  version  of  the  error,  say  z  *  Ly.  For  example,  sucn  a 
filter  operator  could  be  used  to  emphasize  a  particularly  crucial  band  in  the  frequency  domain  or  to 
deemphasize  low  or  high  frequencies.  Including  the  operators  L  and  R*'  yields 


(2.1b) 
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E('z(t)'2)  *  -jp  •itgr‘1ii22 

The  stable  operators  L  and  R*^  serve  as  weightings  in  the  frequency  domain  in  the  inteyral  (2. 1 b j . 

This  measure  of  performance  has  received  a  great  deal  of  attention  in  the  literature  on  Wiener  and 
Kalman  filtering  and  the  1 inear-quadratic-Gaussian  control  problem. 

The  second  type  of  performance  measure  is  “worst  case"  performance.  The  idea  in  this  case  is  to  let 
the  input  be  any  arbitrary  function  in  l_2  with  unit  norm.  Then  the  size  of  the  error  in  the  worst 
case  is 

sup  iiyii,  =  nG»,  =  »gii  (2.16) 

iiuii  2=1  2  2 

Good  performance  in  this  sense  requires  having  a  small  value  of  o(g(j<o))  at  every  frequency  w. 

The  utility  of  this  approach  to  measuring  performance  is  increased  by  introducing  weignting  operators  L 
and  R  mugh  as  was  done  for  stochastic  performance.  Witn  the  weightings,  we  have 

sup  "Lyn,  =  nigr"'ii  (2.  17) 

«Ruii  2=1  i 

This  measure  of  performance  can  be  written  in  a  slightly  different  form  to  empnasis  its  use  as  a 
performance  specification.  Mathematically, 

" Ly" 2  <  1  for  all  »Ruii2<  I 

44  iiLGR-1n 2  <  1  (2.18) 

44  mgr"1!^  <  1 

This  says  that  having  the  maximum  singular  value  of  the  (weignted)  transfer  matrix  less  than  one  at 
every  frequency  is  necessary  and  sufficient  for  the  (weighted)  error  to  have  norm  less  than  one  for  any 
(weighted)  input  with  norm  no  larger  than  one.  This  measure  has  been  the  subject  0f  recent  theoretical 
interest  within  the  control  community  [7],  [8].  Also,  when  viewed  as  condition  on  the  norm  of  LGR'1, 
this  measure  of  performance  is  applicable  to  nonlinear  systems. 

Both  of  these  performance  measures  are  expressed  in  terms  of  norms  on  weighted  transfer  matrices. 

Both  are  useful  in  the  analysis  of  linear  systems.  The  design  problem  in  the  first  case  is  to  minimize 
“average"  error  in  the  integral  square  sense.  In  the  second  case,  the  design  problem  is  a  mini-max 
problem:  minimize  the  worst  case  error  in  the  integral  square  sense.  The  remainder  of  this  paper  will 
use  the  second  measure  of  performance  as  expressed  in  (2.18).  This  will  allow  us  to  obtain  conditions 
on  simultaneous  performance  and  robustness.  As  an  aside,  we  note  that  the  performance  measure  in  (2.1b) 
can  also  be  interpreted  as  a  mini -max  problem  with  the  error  signal  measured  by  the  L_  norm. 

2.3  Sectors 

In  the  last  several  years,  the  sector  concept  has  been  recognized  as  an  important  tool  in  feeooack 
design  and  analysis  [.9],  [10],  [11].  The  basic  idea  is  that  very  complicated  plant  operators  (perhaps 
nonlinear,  infinite  dimensional,  time-varying)  can  be  reliably  approximated  by  simple  sector  centers 
(usually  finite  dimensional,  linear,  time-invariant  systems),  provided  that  the  approximation  error  is 
properly  accounted  for  in  the  design  process.  This  "proper  accounting"  usually  means  that  a  design 
based  on  the  sector  center  must  be  restricted  to  maintain  stability.  Such  restrictions  generally 
increase  as  the  magnitudes  of  the  approximation  errors  grow.  This  section  provides  an  introduction  to 
sectors  and  gives  a  stability  test  in  terms  of  a  sector  condition.  It  concludes  with  an  interpretation 
of  the  stability  test  as  imposing  restrictions  on  the  nominal  design. 


In  abstract  terms,  a  sector  condition  is  a  functional  inequality  describing  the  set  of  operators  in 
a  specified  neighborhood  of  some  nominal  operator.  Formally,  the  sector  (C.L.Rj  is  the  set  of  all 
operators  6  mapping  L2fi  into  L2e  which  satisfy 
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The  nominal  operator  C  is  referred  to  as  the  center  of  the  sector.  The  operators  L  and  K  specify  the 
size  of  the  neighborhood  about  C.  We  will  require  L  and  R  to  be  L^-stacle  operators  ana  to  nave 
L2e*stable  inverses.  When  L  and  R  are  linear,  time-invariant  operators,  this  means  they  have  no  poles 
or  zeros  in  the  right-half  of  the  complex  plane  and  no  excess  of  poles  or  zeros.  In  the  case  of  an 

1- 2e-stable  center,  condition  (2.19)  can  be  rewritten  as 

uL(G-C)K',h2  <  1  (2.2u) 

It  is  noteworthy  to  compare  this  condition  with  the  performance  specification  (2.18). 

It  is  helpful  to  introduce  two  alternate  representations  of  a  sector.  From  the  defining 
relationship  (2.19),  it  is  clear  that  the  sector  (C,t,R)  is  equivalent  to  the  parallel  combination  of 
the  center  C  and  the  sector  (0,L,R)  having  the  null  operator  as  its  center.  This  equivalence  is 
illustrated  in  Figure  2-1.  There  is  a  "multiplicative"  representation  of  a  sector  as  well  as  this 
"additive"  one.  The  sector  (C,L,R)  is  equivalent  to  the  cascade  of  the  center  c  witn  sector 
(I,L,RC*^)  having  the  identity  operator  as  its  center.  This  decomposition  is  also  shown  in  Figure 

2- 1.  Both  of  these  representations  will  be  useful  in  the  sequel. 

Conic  sectors  are  those  sectors  for  which  the  operator  L  is  simply  the  identity.  It  G  is  a  memoer 
of  the  conic  sector  or  cone  (C,R)  with  l_2e-stable  center,  then 

ii  G-C  h2  <  ii  R  ii  (2.21 ) 

and  so  the  output  of  C  approximates  the  output  of  G  to  within  approximation  error  bounded  by  the  norm 
of  R.  Conic  sectors  have  simple  graphical  interpretations  in  two  important  special  cases.  One  case  is 
when  G  is  a  scalar  nondynamical  (i.e.  memoryless)  nonlinearity  and  C  and  K  are  scalar  linear  gains.  Tne 
conic  sector  condition  (2.21)  is  equivalent  to 

i G(x) -Cxi  <  i Rxi  (2.22) 

This  inequality  has  the  simple  graphical  interpretation  that  the  nonlinearity  G(x)  has  its  graph  in 
the  conic  region  between  the  line  of  slope  (C-R)  and  the  line  of  slope  (C+R).  This  condition  is 
illustrated  in  Figure  2-2.  The  second  case  is  when  G,  C,  and  R  are  all  scalar  stable  linear 
time-invariant  operators.  For  this  case,  the  frequency  domain  can  be  used  to  express  the  conic  sector 
condition  (2.21)  as 

ig(jw)-c(ju)i  <  i r( jw) i  ¥  u  (2.23) 

This  condition  describes  the  conic  sector  as  the  set  of  all  frequency  responses  g ( j«*>)  whicn  are  within 
a  distance  ir(jw)i  of  the  nominal  transfer  function  c(jw).  Figure  2-3  provides  a  graphical 
interpretation  of  this  condition  in  terms  of  the  Nyquist  locus  of  5(j“)  and  c(jw).  A  circular 
"template"  centered  at  £(jw)  and  of  radius  r(jw)  in  the  Nyquist  plane  is  defined  at  each  frequency. 

The  value  of  g(>)  must  lie  within  each  of  these  circular  templates.  These  two  examples  snow  how 
conic  sectors  enable  us  to  work  with  sets  of  models  in  the  design  process.  This  is  an  important 
function  since  no  single  model  can  represent  a  physical  system  with  perfect  fidelity. 

2.4  Stability  Conditions 

Having  introduced  sectors,  we  now  turn  to  characterizing  the  stability  of  a  feeuDack  system  when  an 
operator  in  the  system  is  described  by  a  sector.  Consider  the  feedback  configuration  shown  in  Figure 
2-4.  The  two  closed-loop  operators  of  this  system  are  Ej  and  E2.  mapping  both  inputs  U|  ana  u2 
into  the  outputs  e^  and  e2>  respecively.  It  is  assumed  that  the  system  is  causal  and  well-posed, 

[6]  i.e.,  the  operators  M,D,Ej,  and  E2  are  all  causal.  The  system  is  stable  if  botn  Ej  and  E2 
are. 

Theorem  2-1 .  (Small  Gain  Teorem)  Under  these  conditions,  the  closed-loop  system  is  L2e-staole  if 

ii  M  ii  2  ii  a  ii  j  <  1 

A  brief  discussion  of  the  proof  of  this  theorem  follows.  The  error  ej  is  given  by 

e,  *  u,  -  ae2  (2.24) 

*  Uj  -  au2  -  AMe1 
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and  so 


(I+aM)  =  Uj  -  4  u2 


(2-26) 


Tne  norm  of  the  right  hand  side  of  (2.25)  is  bounded  from  above  by 


ii  Uj  ii  2  +  "  4  "2  11  u2  "2 


and  the  left  hand  side  is  bounded  from  below  by 


(1  -  11  a  11,  11  M  11,)  11  e,  n. 


Combining  these  bounds  and  using  the  inequality  of  the  theorem  yields 

"  V  <  WJiK-  ("V2+"A,2"U2"2> 

Thus  the  operator  E^  has  finite  gain  and  is  stable.  Stability  of  is  shown  similarly. 


(2.2b) 


The  Small  Gain  theorem  will  now  be  used  to  obtain  stability  robustness  conditions  for  a  feedback 
system  containing  a  sector.  Stability  robustness  conditions  will  be  obtained  which  guarantee  the 
stability  of  the  closed  loop  system  for  any  operator  that  is  an  element  of  the  sector.  Tne  feedoacx 
system  under  consideration  is  shown  in  Figure  2-5.  Also  shown  in  the  figure  are  two  alternative 
representations  of  the  feedback  loop.  The  first  alternate  uses  the  earlier  observation  that  a  sector 
may  be  expressed  or  the  parallel  combination  of  its  center  and  a  sector  witn  the  same  radius  centered 
about  the  null  operator.  The  second  alternative  representation  employs  the  closed  loop  operator  for  the 
feedback  combination  of  G  and  C.  It  is  assumed  that  the  closed  loop  operator  i*l  is  L2e-stable. 

The  Small  Gain  Theorem  will  be  applied  to  the  feedback  system  in  Figure  2-5  involving  operator  m  and 
sector  (O.L.R).  Let  the  operator  a  be  any  element  (O.L.R).  By  (2.20),  this  means  that 


11  LaR'  11 2  <  1 


(2.27) 


Rather  than  applying  the  Small  Gain  Theorem  directly  to  M  and  a,  we  apply  it  to  the  operators  KML 
and  LaR*1,  shown  in  Figure  2-6.  These  operators  are  both  L2e-staole,  and  closed  loop  staDility  of 
the  system  in  Figure  2-6  is  equivalent  to  stability  of  the  representations  in  Figure  2-5.  By  the  Small 
Gain  Theorem,  closed  loop  stability  is  guaranteed  if 


i*RML_1 11 2  it  LaR-1 11 2  <  1 


(2.28) 


When  combined  with  (2.27), 


11RML-  11 2  <  1 


suffices  to  ensure  closed  loop  stability.  This  is  suiwiarized  in  the  following  theorem: 


Theorem  2-2.  Consider  the  feedback  system  of  Figure  2-5  and  assume  (C+G-')"^  is  L2e*stable.  It 

iiR(C+G*1)‘1L"1»2  <  1 

then  the  closed  loop  system  is  stable  for  any  operator  in  the  sector  (C,L,R). 


Theorem  2-2  can  be  interpretted  in  classical  frequency  domain  terms  for  the  special  case  of  a  conic 
sector  (C,R).  Consider  the  closed  loop  operator 


CGd+CG)'1  *  C(C+G'1)*1  *  (H-(CG)'V1 


(2.30) 


Theorem  2.2  shows  that  stability  can  be  achieved  for  all  operators  in  the  conic  sector  (C,R)  if  the 
nominal  closed  loop  response  (2.30)  of  the  feedback  system  is  restricted  to  be  small  for  all  inputs 
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which  have  large  normalized  conic  sector  approximation  errors,  RC"\  For  linear,  time-invariant 
operators,  condition  (2.13)  can  be  used  to  express  this  in  the  frequency  domain.  Tne  inequality  of 
Theorem  2-2  is  satisfied  if 

hRC'm  < - - , — r -  (2.31) 

1  »C(C+G-  )"  ii 

or,  in  the  frequency  domain, 

<j[r(jw)c‘,(jw)]  <  o  [I+(c(j«)g(ja.))‘1]  ¥■  ui  (2.32) 

Note  that  (2.32)  imposes  explicit  magnitude  constraints  on  tne  nominal  closeu  loop  frequency  response. 

At  all  frequencies  where  the  normalized  error  r( juijc*' ( jw)  is  large  compared  with  unity,  the 
inverse  loop  transfer  matrix  (c( ju)g( jw) J'1  must  also  be  large.  Hence,  the  loop  transfer  matrix 
itself  must  be  small.  Since  r(ju)c~'(jw)  typically  grows  large  at  higner  frequencies,  this 
constraint  imposes  explicit  limitations  on  acnievable  crossover  frequencies  and  closed  loop  banowidtns. 

It  will  be  helpful  to  compare  the  stability  robustness  condition  of  Theorem  2.2  witn  the  performance 
specification  (2.18).  Stability  robustness  is  ensured  if  a  closed  loop  operator  when  "weiynted"  by  R 
and  L  1  has  norm  less  than  one.  Performance  is  achieved  if  some  closed  loop  error  operator  has 
“weighted"  norm  less  than  one.  The  same  type  of  condition  applies  in  both  cases.  Because  the  same 
condition  is  used  for  both  a  performance  specif ication  and  stability  robustness,  we  will  be  able  to 
obtain  a  combined  condition  which  can  guarantee  simultaneous  robust  performance  and  stability. 

3.0  CONIC  SECTORS  FOR  HYBRID  COMPENSATORS 

3.1  The  Hybrid  Compensator 

A  hybrid  compensator  consisting  of  both  analog  and  digital  elements  is  snown  in  Figure  3-1. 

Although  it  contains  a  digital  computer,  the  hybrid  compensator  transforms  an  analoy  input  signal  to  an 
analog  output  signal.  In  Section  3,  we  will  show  how  to  construct  useful  conic  sectors  whicn  contain 
this  operator.  The  hybrid  compensator  is  introduced  in  3.1.  A  fundamental  result  of  Thompson  [1J  is 
given  in  3.2  which  enables  us  to  place  a  staole  compensator  inside  a  cone.  In  3.3,  we  show  now  to  use 
this  result  for  unstable  compensators  and  how  to  reduce  the  radius  of  the  cone.  The  approach  is 
extended  to  multi-rate  hybrid  compensators  in  3.4. 

As  shown  in  Figure  3-1,  the  hybrid  compensator  consists  of  a  prefilter,  a  sampler,  a  digital 
computer,  and  a  hold  device.  The  prefilter  Fisa  linear,  time-invariant  operator  mapping  the  input  e 
into  a  filtered  signal  e^.  It  serves  to  reduce  the  high-frequency  content  of  the  signal  to  the 
sampler  and  thereby  reduces  the  problems  associated  with  aliasing.  The  sampler  converts  the  analog 
signal  ef  into  the  discrete  time  sequence  ed.  It  operates  with  a  sample  time  T  or  sampling 
frequency  «s  =  2»/T  and  is  considered  ideal,  i.e. 

ed(n)  «  ef(nT)  (3.1) 

for  any  integer  n.  The  operator  D  associated  with  the  digital  computer  is  a  linear,  shift-invariant 
mapping  of  ed  into  u^.  The  hold  device  transforms  the  discrete-time  sequence  ud  bacx  into  an 
analog  output  u. 

An  equivalent  analog  representation  of  the  hybrid  compensator  is  shown  in  Figure  3-2.  Here  the 
ideal  sampling  operation  is  represented  as  modulation  of  the  signal  e^(t)  oy  an  infinite  train  of 
impulses.  The  ‘-notation  is  used  to  deonte  the  analog  representation  of  the  corresponding  discrete-time 
signal.  It  is  well-Known  that  modulation  by  an  impulse  train  is  equivalent  to  convolution  by  an  impulse 
train  in  the  frequency -domain,  i.e. 

ed  =  —  ft  ef  (J“-Jwsn) 

The  analog  signals  ej(t)  and  u2(t)  are  related  to  their  discrete-time  counterparts  by 

e*( t.)  *  ft  ed(n)  «(t-nT)  (3.3) 


and 


UjU)  *  ft  ud(n)  J(t-nT) 


(3.4) 
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The  Fourier  Transform  of  e^  is  related  to  the  Z  transform  of  by 

e^O)  =  ed(z)  i  z=e>T  (3.5) 

The  digital  computer  is  now  represented  as  the  linear,  time-invariant  operator  0*.  It  is  easily 
verified  that  (3.4)  requires  0*  to  be  defined  as 

d*(>)  =  d ( z )  i  z=e>T  (3-6) 

The  hold  operator  is  simply  a  linear,  time-invariant  mapping  of  analog  signals,  by  abuse  of  notation, 
we  use  the  same  symbol  H  to  represent  the  hold  device  in  both  figures.  Tne  representation  of  the  hyDriu 
compensator  in  Figure  3.2  will  be  adopted  in  the  sequel- 

The  hybrid  compensator  will  be  denoted  by  the  linear  operator  K.  It  maps  e  into  u  according  to  tne 
time  varying  convolution 

u(t)  =  £  k(t,t)e(t)dr  (3.7) 

The  operator  K  is  time-varying  because  of  the  sampling  operation.  Tnus  it  cannot  be  represented  in  the 
frequency-domain  as  a  transfer  function.  It  is  possible,  however,  to  relate  the  Fourier  Transforms  of  e 
and  u,  viz. 

u(j“)  =  yh(jw)d  ( jw)tf (j«>-j“sn)e( jw-ja^n)  (3.8) 

This  equation  is  discussed  at  length  by  Franklin  and  Powell  [12].  It  comes  from  the  concatenation  of 
the  operations  in  Figure  3-2.  It  is  used  extensively  in  3.2  to  construct  a  conic  sector  containing  K. 

3.2  Construction  of  a  Conic  Sector  that  Contains  a  Hybrid  Compensator 

In  his  recent  PhD  dissertation,  Thompson  [1]  shows  that  a  staDle  hybria  operator  K  can  be  placeo 
inside  a  computable  conic  sector  (C,R).  This  allows  the  sector  concepts  discussed  2.3  to  oe  applied  to 
such  compensators.  We  now  review  and  discuss  Thompson's  result. 

Theorem  3.1  [1]  Consider  an  L2e-stable  hybrid  operator  K,  and  let  C.R.R"1  be  linear, 
time-invariant,  l_2e-stable  operators.  Under  these  conditions,  K  is  in  the  cone  (C,K)  if 

o  [r( ju» ) ]  >  [r^jwj+r^jw)]*^2  (3.9) 

where 

r,( jw)  =  i  r  ii  y  h(jw+j«  m)d  ( j«)f ( jw+j“$n)"2 
m  n=m 

•«  a  -*  “  2 

r2(j“)  =  t  "  y  h(j«j+j«  m)d  (j<*>)f (j“+j“sm)-c(jw+jo‘sm)ii2 
m 

This  significant  theorem  is  proved  in  Appendix  A.  It  says  that  any  hybrid  compensator  can  be  placed 
inside  some  appropriate  conic  sector.  The  utility  of  this  fact  depends  on  the  size  of  the  radius,  cf . 
Theorem  2-2.  Note  that  r^  and  r2  are  periodic  with  period  Given  a  center,  condition  ( 3- y ) 
imposes  a  magnitude  constraint  on  the  radius  R,  and  it  is  always  possible  to  choose  R  so  that  13.9)  is 
satisfied  with  equality. 

Altnough  any  stable,  linear,  time-invariant  operator  can  be  used  as  tne  center  in  tne  theorem,  only 
tnose  centers  that  result  in  a  "small"  radius  are  useful.  From  the  discussion  in  Z.i  on  conic  sectors, 
the  center  represents  an  approximation  to  the  hybria  operator.  One  very  useful  choice  for  tne  center  is 

c(ji*>)  =  |  n(j«)  d*(j«)f (jw)  (3.10) 

This  center  makes  r2(>)  =  0  and  so  is  referreu  to  as  the  optimal  center.  Since  r^  is  independent 
of  the  center,  the  choice  (3.10)  results  in  the  cone  with  the  smallest  radius.  An  alternate  choice  for 
the  center  is 

*  1  *  A 
c(jw)  *  y  h(>)da(jw)f ( jw) 

where  djjw)  represents  some  desired  analog  compensation. 

a 


(3.11) 
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It  is  noteworthy  that  given  a  center,  the  optimal  radius  is  computable.  Thompson  [1]  discusses  the 
convergence  properties  associated  with  the  defining  sums  for  r^  and  r^.  He  also  provides  analytic 
solutions  for  r-j  in  important  special  cases. 

The  conic  sector  representation  of  a  hybrid  operator  allows  the  application  of  the  staoility  tests 
of  Section  2.3.  Consider  a  plant  G  in  a  negative  feedback  configuration  with  hybrid  compensator  K. 
Theorem  2-2  can  be  applied  to  guarantee  stability  of  the  feedback  system.  Tne  theorem  assumes  that  the 
nominal  closed  loop  operator  (C+G"^)~^  is  stable.  Staoility  is  maintained  provided 

a  {r(ju)c'1(>)[l+(cl>>)g(ju))  ']*')  <  1  ■¥«>  (3.12) 

From  a  design  viewpoint,  the  analog  center  C  is  constructed  to  achieve  closeo-loop  staoility.  T»e 
digital  implementation  maintains  stability  if  the  normalized  error  RC~'  is  sufficiently  small. 

It  is  important  to  recognize  that  stability  of  the  nominal  analog  closeu-loop  operator  is  not 
equivalent  to  stability  of  the  nominal  discrete  time  closed-loop  operator.  This  is  illustrated  by  the 
example  systems  shown  in  Figure  3-3.  The  Nyquist  plots  for  these  two  systems  are  shown  in  Figure  3-4. 

The  discrete  time  system  is  stable,  while  the  continuous-time  system  is  unstable.  The  stabilizing 
effect  of  the  sampling  operation  can  be  attributed  to  the  fact  that  aliasing  of  the  plant  contributes 
phase  lead  near  crossover,  see  Figure  3-4.  Tou  [13]  discusses  how  a  pure  time  delay  can  have  a 
stabilizing  impact  on  a  discrete  time  system  because  of  favorable  phase  interactions  under  aliasing. 

This  phenomenon  depends  critically  on  the  phase  characteristics  of  the  plant  at  frequencies  above  the 
half  sample  frequency.  For  usual  aerospace  applications,  the  phase  of  the  plant  is  quite  uncertain  at 
such  frequencies.  Thus  we  feel  that  the  requirement  of  Theorem  3-1  for  stability  of  the  nominal  analog 
closed-loop  operator  is  not  overly  restrictive  for  aerospace  applications. 


The  simple  example  of  a  lead  compensator  is  used  to  illustrate  the  theorem. 
analog  compensator 

da ( j“ )  = 


Consider  a  desired 

(3.13) 


We  first  define  a  hybrid  compensator  which  implanents  this  lead  and  then  compute  the  center  and  radius 
of  a  cone  which  contains  it.  The  hybrid  compensator  is  (somewhat  arbitarily)  defined  as  follows: 


T  =  0.6283  sec. 


d(z)  =  7.66 


z-0.93yi 

7^0335 


The  prefilter  is  a  single  pole  at  0.5«>s.  The  hold  device  is  just  a  zero  order  hold.  Tne  digital 
compensation  was  computed  from  the  desired  lead  (3.13)  by  the  pole-zero  mapping  technique  [12j. 


The  optimal  center  CQ  for  this  example  is 

c0(»  =  y  h(jw)  d(ej“)  f(j-)  (3.14) 

For  comparison  purposes,  we  introduce  an  alternate  center  C  as 

a 

cd  s  — y—  hfl( jw)  dfl( ju)  f(ju>)  (3.  16) 

where  h,(j«)  is  the  first  order  approximation  to  tne  zero  order  hold, 

a 

The  center  C,  is  only  third  order,  yet  fairly  close  to  the  optimal  center,  figure  3-t>  shows  the  bode 

a 

plots  of  the  desired  analog  compensation  0.  and  the  two  centers  C.  and  C,.  Botn  centers  show 
appreciably  more  phase  lag  than  Da  above  (say)  0.3  rad/sec.  The  radius  R  ana  the  quantities  r1  ana 
r.  were  computed  according  to  Theorem  3-1  using  center  c_.  These  quantities  are  plotted  in  Figure 

c  a 

3-6.  Recall  that  r^  is  the  radius  associated  with  the  optimal  center  CQ.  Notice  that  there  is 
little  difference  between  r^  and  R  above  1  rad/sec. 
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The  normalized  error  RC*'  is  plotted  in  Figure  3-7.  Also  shown  in  the  figure  are  the 

a 

normalized  errors  when  0  is  computed  from  the  forward  rectangular  rule,  the  backward  rectangular  rule, 
and  Tustin's  rule  prewarped  about  0.5  rad/sec.  The  differences  do  not  appear  significant.  Notice  that 
in  each  case,  the  normalized  error  is  greater  than  one  at  every  frequency.  From  stability  robustness 
condition  (2.31),  it  is  not  possible  to  achieve  large  loop  transfer  gains  using  this  hybrid 
compensator.  Increasing  the  sample  rate,  as  shown  in  Figure  3-8,  does  lower  the  normalized  error  below 
one  over  a  significant  frequency  range. 

3.3  Techniques  to  Increase  the  Applicability  of  the  Conic  Sector 

Theorem  3.1  determines  a  conic  sector  which  contains  a  stable  hybrid  operator.  Two  shortcomings  of 
the  theorem  as  it  stands  are 

(i)  the  hybrid  operator  must  be  stable,  e.g.  hybrid  operators  witn  integral  action  cannot  be 
handled, 

(ii)  directionality  information  associated  with  multi  loop  compensators  is  lost  in  constructing  the 
radius,  i.e.  stability  tests  using  the  conic  sector  will  tend  to  be  conservative  in  cases  where 
the  digital  operator  D  has  a  large  spread  in  its  singular  values. 

This  section  shows  now  to  exploit  the  inherent  flexibility  of  Theorem  3.1  to  overcome  these  two 
shortcomings. 

The  first  topic  to  be  treated  is  obtaining  a  conic  sector  representation  that  is  useful  witn 
unstable  hybrid  compensators  and  has  a  “small"  radius.  Recall  the  analog  representation  of  the  nybrid 
compensator  shown  in  Figure  3-2.  F,  D*,  and  H  are  all  linear,  time-invariant  operators  --  only  the 
sampling  operation  is  time-varying.  Our  approach  is  to  obtain  a  conic  sector  that  includes  the  sampling 
but  excludes  the  other  components  of  the  hybrid  compensator. 

Consider  the  equivalent  analog  representation  of  the  hybrid  compensator  shown  in  Figure  J-9.  The 
operator  A  introduced  in  the  figure  is  linear,  time-invariant  with  no  poles  or  zeros  in  the 
right-half-plane.  Let  the  operator  S  denote  the  operator  inside  the  dashed  box  in  Figure  3-9,  i.e.  the 
concatenation  of  A,  modulation  by  the  impulse  train,  followed  by  A.  We  place  the  sampling  operator  S 
inside  a  conic  sector  according  to  Theorem  3-1  and  use  the  freedom  offered  by  a  to  reduce  the  normalized 
error  of  the  cone. 

Corollary  3.1  Let  R  and  R'^  be  linear,  time-invariant,  L2g-stable  operators. 

1  ? 

Then  S  is  in  the  cone  (y  A  ,  R)  if 
*  1  2  1/2 

o  [r(ju)]  >  [i  i  iiy  a(>+jw  m)a(ju+j«  n)u‘]  (3.17) 

m  n=m  1  s 

This  corollary  is  a  straightforward  application  of  Theorem  3.1.  it  handles  the  situation  where  D* 
is  unstable  by  excluding  the  digital  compensation  from  the  cone.  For  analysis  purposes,  the  prefilter, 
digital  compensator,  and  hold  (as  well  as  the  two  A*'  operators)  are  included  with  the  plant.  Let  G 
denote  the  plant.  Theorem  2-2  can  be  applied  if  the  nominal  closeo-loop  operator  is  staDle.  Tms 
operator  is  simply 

({  A2  +  [A‘1FGH0*A'1]'1)'1  *  TA‘2(I  ♦  [TFGHD*]'1 ) *’  (3.18) 

Stability  is  maintained  provided  .  ,  .  .  . 

II  K  (yA^  +  [A’  FGHD*A*  J*  )*  II2  <  1  13. ly) 

or 

U  RTA'2  (I  +  lTFGHO*]*1)'1  «2  <  1  (3.20) 

Notice  that  the  conic  sector  being  used  has  center  and  radius  that  are  essentially  independent  of  tne 
compensator  and  the  plant  being  controlled.  Only  the  sample  time  T  and  sampling  frequency  »s  come 

into  play  In  the  definition  of  the  cone.  The  operator  A  can  be  chocen  to  try  to  minimize  the  normalized 

_2 

error  KTA  .  More  precisely,  we  are  free  to  choose  A  to  achieve  (3.20)  as  long  as  stability  of  (3.18) 
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is  not  compromised.  Since  A  has  no  right-half-plane  zeros,  the  only  way  stability  of  couild  be 

.9 

compromised  is  if  A  rolls  off  too  fast  as  a  function  of  frequency  and  so  A  would  "roll  up"  too 
fast.  This  is  not  a  major  concern,  just  something  that  must  be  checked.  If  A  has  first  order  roll  off 

_o 

characteristics,  then  A  is  guaranteed  to  be  acceptable. 


Some  reasonable  choices  for  A  are  now  illustrated  by  example.  From  (3.20),  the  objective  is  to  make 
RTA'2  small.  The  examples  are  normalized  by  taking  =  1/sec  and  so  T  =  2*  sec.  Tne  first 
candidate  operator  is 


A  = 


S+e 


(3.21) 


As  e  approaches  zero,  it  can  be  shown  analytically  that  the  normalized  error  becomes 

r(j“)Ta  ^(ji«)  *  »  STTS  u 


(3.22) 

Notice  that  in  this  case  the  normalized  error  reaches  1  at  about  0.39  rad/sec  -  not  far  from  the  hair 
sample  frequency.  This  behavior  would  be  expected  to  be  quite  adequate  in  cases  where  the  closes  loop 
system  rolls  off  significantly  before  the  half  sample  frequency.  More  general  candidate  operators  are 

A  ■  (s4[7ITT)  t3-*3) 

and  again  we  examine  the  behavior  as  e  approacnes  zero.  No  general  analytic  results  are  available  in 
this  case,  but  Figure  3-10  shows  the  normalized  error  for  r  =  0,1,10.  The  t  =  u  case  is  just  the 
first  candidate  (3.21).  These  three  curves  show  a  tradeoff  between  high  and  low  frequency  benavior.  As 
t  increases,  the  normalized  error  decreases  for  low-frequencies  at  the  expense  of  increases  at  high 
frequencies.  These  choices  for  A  appear  to  be  useful,  but  their  relative  utility  would  depend  on  the 
particular  application,  cf.  (3.20). 


An  example  shows  that  the  choice  reflected  in  (3.21)  cannot  be  uniformly  improved.  Let  FGHU*  be 
defined  as 

100»s+100 

- - - 

100»as‘+(w+100)as+o-100 

The  closed  loop  system  is  stable  for  o=2.4  but  unstable  for  a=2.3.  Figure  3-11  is  a  grapnical 
version  of  the  stability  test  (3.20).  The  plot  shows  '(1  +  (TFGHD*)”1  )_1 1  and  »/2/3<i  for 
o=2.3.  This  shows  that  condition  (3.20)  is  violated,  and  thus  stability  cannot  be  guaranteed.  Since 
the  closed  loop  system  is  in  fact  unstable,  there  is  little  conservatism  in  the  test.  Hence,  the 
suggested  choice  of  A  results  in  a  normalized  error  that  cannot  be  uniformly  improved. 


The  above  Corollary  3-1  applies  to  single  or  multiple  loop  hybrid  compensators.  The  approach  of 
placing  just  the  sampling  operation  inside  a  conic  sector  does  preserve  the  directionality  information 
of  the  digital  compensation.  There  is  additional  freedom  in  the  multi -loop  case,  however,  to  enhance 
the  stability  test  (3.20). 


Figure  3-12  is  an  equivalent  representation  of  the  dashed  box  in  Figure  3-9.  Tne  operators  Q  and 
Q”1  are  linear,  time-invariant  and  L2g-stable.  Also,  the  transfer  matrices  of  Q  and  Q*1  are 
periodic  with  period  ws.  We  intend  to  place  the  operator  shown  in  the  dashed  box  of  Figure  3-13 
inside  a  conic  sector.  Suppose  A  is  restricted  to  be  a  scalar  times  the  identity.  Tnen  Q  commutes  witn 
A  and  the  modulation  by  the  impulse  train.  This  means  that  I)  and  Q'1  inside  the  dashed  box  cancel 
each  other,  and  so  the  conic  sector  of  Corollary  3.1  still  applies. 

The  net  result  of  Introducing  Q  and  Q"^  is  that  stability  condition  (3.2U)  becomes 

«  Q  {RTA'2(I+[TFGHU*J',)‘1}  q'1  h  <  I  (3.24) 

Thus  the  operator  in  (3.20)  is  preceded  by  ()"'  and  followed  by  Q.  One  interpretation  is  that  Q 
introduces  a  particular  weighting  on  the  original  cone.  The  additional  freedom  offered  by  0  expands  the 
class  of  plants  that  can  be  guaranteed  stable  using  conic  sectors.  Tne  question  of  how  to  choose  the 
right  Q  will  be  discussed  in  Section  5.  For  now,  we  simply  note  that  the  existence  of  a  single  o  so 
that  (3.24)  1$  satisfied  is  enough  to  guarantee  stability. 


4.  \ 
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3.4  Multi-Rate  Hybrid  Compensators 

It  is  straightforward  to  extend  the  conic  sector  approach  for  hybrid  compensators  to  tne  general 
case  of  multiple  independent  hybrid  compensators  having  arbitrary  sample  rates.  Eacn  individual 
compensator  can  be  placed  in  a  cone,  and  then  a  composite  cone  can  be  obtained  for  the  overall 
multi-rate  compensator.  This  composite  cone  is  suitable  to  use  with  Theorem  2-2  to  assure  stability. 

A  general  multi-rate  hybrid  compensator  employing  k  independent  sample  rates  is  shown  in  Figure 
3-13.  Following  the  development  of  3.3,  we  place  each  sampler  inside  a  conic  sector.  Thus  the 
operators  A,.  A2  ....  Aj^  are  introduced  and  the  radii  ,  R2  •••.  Rk  are  given  according  to 
Corollary  3.1.  Also,  the  weighting  operators  Q1 ,  Q2,  ....  Qk  having  the  appropriate  periodicity 
are  introduced.  For  notational  simplicity,  we  define  the  following  block  diagonal  operators: 

Fm  s  diag  (F^,  F2,  *..,  F^} 

D*  =  diag  (D*.  D2,  ....  0*1 

Hm  «  diag  (H^,  H2,  ....  H^} 

TM  =  diag  {T1  II*  T2  X2*  Tk  V 

A^  -  diag  {A^,  A2,  ...,  A^I 

0M  ~  diag  (Q^,  Q2>  ...,  Q^l 

Rm  =  diag  (R^,  R2,  ....  R^I 

where  Ij,  I2,  ...,  Ik  are  identity  operators  of  appropriate  dimension. 

Given  these  preliminaries.  Theorem  2-2  can  be  used  to  guarantee  staDility  of  the  closeu-loop  system 
in  Figure  3-13.  The  nominal  closed-loop  operator  is 

Vm2  +  WW1)'1  ^Z5) 

If  this  operator  is  stable,  then  stability  will  be  maintained  witn  the  multi-rate  nybrid  compensator 
provided 


Wm2  “  +  WW'V1  "2 


(3.2b) 


Equations  (3.25)  and  (3.26)  are  direct  analogs  of  (3.18)  and  (3.24)  in  the  case  of  a  single  hybrid 
compensator.  The  block  elements  of  are  periodic,  but  they  are  otnerwise  free  and  can  be  chosen  to 
minimize  the  operator  norm  in  (3.26).  We  postpone  the  discussion  of  the  appropriate  choice  of  these 
elements  of  QM  until  Section  5. 

Summarizing  this  section,  conic  sectors  have  been  found  for  single  and  multiple  sample  rate  hybrid 
operators.  This  permits  a  hybrid  compensator  to  be  analyzed  as  an  analog  compensator  with  the 
resulting,  quantified  approximation  error.  Provided  the  analog  compensator  gives  closed-loop  stability, 
conditions  (3.24)  and  (3.26)  can  be  used  to  guarantee  stability  of  the  closed-loop  system  witn  tne 
hybrid  compensator.  The  real  significance  of  using  conic  sectors  for  hybrid  compensators  will  be  shown 
in  Section  5.  There  we  will  obtain  conditions  which  guarantee  stability  and  performance  for  a  uncertain 
plant  controlled  by  a  hybrid  compensator. 

4.0  FEEDBACK  ANALYSIS  AS  A  BLOCK -DIAGONAL  BOUNDED  PERTURBATION  PROBLEM 


4.1  Robustness  Characterization 

This  section  formulates  the  basic  feedbacx  proolem  of  achieving  performance  in  tne  face  of 
uncertainties  as  a  stability  problem  in  tne  presence  of  block-diagonal  bounded  perturbations  L4).  The 
formulation  involves  sector-bounded  transfer  functions  as  basic  building  blocks.  Tne  robustness  and 
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performance  properties  of  a  feedback  system  will  be  expressed  in  terms  of  a  collection  of  linear, 
time-invariant  operators,  a^  ,  i*l,2,...,m,  each  of  which  is  an  element  of  a  sector  with  zero 
center,  i.e.  a-c(0,Lj,R^).  These  operators  are  the  basic  builoing  blocxs  in  a  comoined 
robustness/performance  characterization  of  feedbacx  systems. 

The  use  of  sector  bounded,  linear,  time-invariant  operators  to  characterize  robustness  has  been  a 
central  theme  in  many  recent  references,  including  [14]  where  such  operators  were  inserted  at  tne  inputs 
or  outputs  of  a  plant  model  in  order  to  represent  so  called  unstructured  uncertainties  (modeling  errors 
with  no  assumed  structure  except  for  known  magnitude  bounds  on  their  transfer  functions).  Necessary  and 
sufficient  conditions  such  as  Theorem  2-2  were  then  derived  for  stability  robustness  in  the  face  of  such 
uncertainties.  For  example,  a  staole  feedback  loop  with  plant  G  and  compensator  K  will  remain  staDle  in 
the  face  of  all  possible  perturbed  plants  G‘  =  [I  +  a]G,  with  a  e(0,L,R),  if  and  only  if 

J[RGK(I+Gk)',L*1]  <  1  M-  u  (4.1) 

Note  that  with  R  and  L  specified,  this  inequality  imposes  conditions  on  the  shape  of  the  closed  loop 
frequency  response,  GK(I  +  GK)*1,  which  must  be  satisfied  in  order  to  assure  robust  staoility.  Tnese 
conditions  are  unique  to  the  assumed  form  of  plant  perturbations  (e.g.,  G*  =  ( 1  +  a)G 
in  the  present  case).  Each  such  assumed  form  corresponds  to  a  specific  location  where  a  is  inserted 

in  the  nominal  feedback  loop.  The  location  for  our  present  case  is  shown  in  Row  1  of  Table  4.1.  Other 

locations  correspond  to  other  assumed  forms  for  G1  and  produce  different  necessary  and  sufficient 
stability  robustness  conditions.  A  representative  set  of  possibilities  is  summarized  in  the  remaining 
rows  of  Table  4.1.  (Most  of  these  cases  can  be  found  in  [lbj). 

Table  4.1  also  indicates  representative  types  of  physical  uncertainties  which  can  be  usefully 
represented  by  sector  bounded  perturbations  inserted  at  the  indicated  locations.  For  example,  the 
representation  G'  =  (I  +  a)G  in  Row  I  is  useful  for  output  errors  at  high  frequencies,  covering  such 
things  as  unroodelled  high  frequency  dynamics  of  sensors  or  plant,  including  diffusion  processes, 
transport  lags,  electro-mechanical  resonances,  etc.  The  representation  G'  *  G(1  +  a)  in  Row  2  covers 
similar  types  of  errors  occurring  at  the  Inputs.  Botn  cases  should  be  contrasted  with  Rows  4  and  5 

which  treat  G'  =  (I  +  a)_1G  and  G'  *  G(1  +  a)*1.  These  representations  are  more  useful  for 

variations  in  modelled  dynamics,  such  as  low  frequency  errors  produced  by  parameter  variations  witn 
operating  conditions,  with  aging,  or  across  production  copies  of  the  same  plant.  Discussion  of  still 
other  cases  is  left  to  the  table.  Note  from  the  table  that  the  stability  requirements  on  a  do  not 
limit  our  ability  to  represent  variations  in  either  the  number  or  location  of  rhp  singularities. 

The  most  significant  thing  to  understand  about  Table  4.1  is  that  the  stability  robustness  conditions 
shown  are  sufficient  to  assure  stability  only  if  all  the  uncertainties  occur  at  the  indicated  locations 
and  none  occur  elsewhere.  In  order  to  use  the  conditions  directly,  therefore,  designers  are  obliged  to 
reflect  all  known  sources  of  uncertainty  from  their  known  point  of  occurrence  to  a  single  reference 
location  in  the  loop.  Such  reflected  uncertainties  invariably  have  a  great  deal  of  structure  which  must 
then  be  “covered  up”  with  a  larger,  arbitrarily  more  conservative  perturuation  in  order  to  maintain  a 
simple  cone  bounded  representation  at  the  reference  location.* 

Alternatively,  designers  could  choose  to  treat  uncertainties  occurring  at  several  different 
locations  in  the  feedback  loop  as  a  single  uncertainty  occurring  at  one  location  in  a  larger  feedback 
loop.  To  be  specific  about  this  alternative,  let  a.,  1*1,  2,  ....  m,  denote  a  collection  of  sucn 
uncertainties  positioned  at  location  1*1,  2,  ....  m.  Note  that  at  each  t^,  the  feedback 
loop  has  an  input,  where  it  receives  the  signals  from  a^,  and  also  an  output,  where  it  supplies 
signals  to  a^.  Let  M(<  be  the  operator  between  these  two  sets  of  signals.  Further,  let  M.^ 
denote  the  operator  between  the  inputs  at  location  tj  and  the  outputs  at  location  i^.  Then  the 
block-  structured  operator 

M  *  {  M1j  1  (4.2) 

*  By  “arbitrari ly  more  conservative,"  we  mean  that  examples  can  be  constructed  where  the  degree  of 
conservatism  is  arbitrarily  large.  Of  course,  other  examples  exist  where  it  is  quite  reasonable. 
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represents  all  Interactions  of  the  feedback  loop  with  its  uncertainties,  and  indeeo,  the  blocx-diagonal 
bounded  perturbation  diagram  in  Figure  4,1  is  an  equivalent  representation  of  the  loop.  Here  we  have 
a  *  diag(Ar  ....  nj. 

Note  that  the  feedback  elements  in  this  larger  loop  are  zero  in  the  absence  of  uncertainties. 

Hence,  M  will  be  a  stable  “plant"  wnenever  the  original  nominal  loop  is  stable.  As  an  example  of  this 
representation,  consider  the  system  in  Figure  4.2.  Tnis  system,  witn  two  uncertainties  present 
simultaneously,  the  first  from  Row  2  and  the  second  from  Row  4  of  Table  4.1,  is  describea  by  the 
following  M  operator: 

(I  +  K6)''kG  (1  +  KG)' V 

”  -  (4.3) 

(I  +  GK)  G  (1  +  GK) 

Given  the  equivalent  system  in  Figure  4.1  with  sector  bounded  it  follows  from  the  Small  Gain 
Theorem  that  the  loop  remains  stable  in  the  presence  of  these  uncertainties  if 

mRML'1h2  <  1 

or,  in  the  frequency  domain, 

a  [R(ju)M(j(ii)L(ju)"1]  <  1  IF  u  >  0  (4.4) 

where  R  =  diag(R1  R?  ...  Rm)  and  L  =  diag(L1  |_2  ...  1^).  This  condition  provides  an 
alternate  test  for  stability  robustness.  Like  the  procedure  of  reflecting  all  uncertainties  to  one 
reference  location,  however,  the  new  test  can  be  arbitrarily  more  conservative  because  it  ignores  the 
known  block-diagonal  structure  of  the  uncertainties  in  Figure  4.1. 

One  of  the  objectives  of  the  results  in  this  paper  is  precisely  to  reduce  the  conservatism  of 
robustness  and  performance  tests  for  block  diagonal  structures  sucn  as  Figure  4.1.  He  do  tnis  by 
introducing  a  generalized  notion  of  the  maximum  singular  value  for  block-diagonal  structures.  This 
generalization  is  developed  in  Section  5.  It  is  called  the  structured  singular  value  (S3V|  ana  is 
denoted  by  the  symbol  u.  It  yields  the  following  necessary  and  sufficient  conditions  for  robust 
stability  of  the  BDBP  problem: 

u[R( ju)M( ju)L  (ju)]  <1  ¥■  in  (4.5) 

This  represents  our  extension  of  the  Small  Gain  Theorem  which  we  call  the  Small  u  Theorem. 

Since  all  simultaneous  uncertainties  can  be  put  into  block-diagonal  form  by  merely  constructing  the 
associated  operator  M,  the  SSV  allows  us  to  nonconservatively  analyze  simultaneous  occurences  of 
uncertainties  anywhere  in  a  feedback  system.  The  uncertainties  may  be  sector  bounded  errors  of 
individual  components  of  the  system  (SISO  or  MIMO),  they  may  be  individual  parameter  variations  in  the 
model,  or  even  polynomial  approximations  of  parameters  entering  nonlinearly.  In  fact,  the  only 
restrictions  which  remain  is  that  all  variations  must  be  allowed  to  be  complex.  Pure  real  variations  or 
pure  imaginary  variations  cannot  be  separated  into  individual  blocks. 

4.2  Performance  Characterization 

The  ability  to  treat  simultaneous,  structured  uncertainties  also  offers,  almost  as  a  free  byproduct, 
the  ability  to  deal  simultaneously  with  the  performance  and  robustness  aspects  of  feedoack.  As 
discussed  in  2.2  and  2.4,  the  test  for  satisfaction  of  a  performance  specification  is  identical  to  a 
test  for  robustness  with  respect  to  some  uncertainty.  That  is,  any  performance  specification  has  a 
corresponding  robustness  requirement  such  that  one  is  satisfied  if  and  only  if  tne  otner  is.  Thus  the 
SSV  tests  for  robustness  can  be  used  directly  to  evaluate  performance. 

The  equivalence  between  performance  and  robustness  is  elaborated  in  Column  4  of  Table  4.1,  where 
each  of  the  conditions  imposed  on  feedback  loop  shapes  by  perturbation  a(  at  location  is 
given  a  performance  interpretation.  For  example,  the  perturbations  in  Row  4  impose  requirements 
(through  L  and  R)  on  the  operator  (1  +  KG)'1.  This  operator  is,  of  course,  the  classical  (output) 
sensitivity  function  of  the  feedback  loop.  Small  values  over  some  frequency  range  guarantee  low  closed 
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loop  sensitivity  to  open  loop  variations  and  low  command  following  errors  to  output  commands  over  that 
range.  A  particular  specification  on  these  performance  parameters  can  thus  be  thought  of  as  imposed  on 
a  design  by  introducing  a  “fictitious  uncertainty"  at  tne  location  in  Row  4  witn  sector  bounus  R  anu  L 
selected  to  meet  the  performance  requirement. 

To  illustrate  how  such  fictitious  uncertainties  actually  enforce  performance  specs,  consiuer  the 
simple  case  where  a  single  true  uncertainty,  say  ar  from  Row  2,  and  a  single  fictitious 
(performance)  uncertainty,  say  Ap  from  Row  4,  are  specifiea  for  our  feedback  system.  Let  the 
structured  singular  value  condition  (4.5)  be  satisfieo  for  the  corresponding  M  matrix  (equation  4.3). 
Then  the  system  remains  stable  in  the  face  of  ar  and  4p  occurring  simultaneously.  Obviously,  it 
will  also  remain  stable  for  ap  with  ar  =  0.  This  means  that  the  nominal  system  must  satisfy  the 
performance  condition 

o'[Rp(I  +  KGf’L*1]  <  1  (4.6) 

because  the  latter  is  also  a  necessary  and  sufficient  condition  for  robust  stability  witn  ap  only. 

This  much  is  straightforward.  What  is  not  so  evident  but  much  more  important  is  that  Condition  (4.6)  is 
also  satisfied  for  all  perturbed  feedback  loops.  That  is,  for  all  true  plants  G'  •  G(I  +  ar)  we  have 

o[Rp( I  +  KG’rVp*1]  <1  K.  (4.7) 

Hence,  the  performance  spec  is  satisfied  in  tne  face  of  all  possible  true  uncertainties.  A  proof  of 
this  consequence  of  the  structured  singular  value  condition  is  left  to  Section  5. 

5.  STRUCTURED  SINGULAR  VALUE  ANALYSIS  OF  FEEDBACK  SYSTEMS 


5.1  Introduction  to  the  Structured  Singular  Value 

We  have  discussed  how  the  problem  of  analyzing  performance  in  the  face  of  structured  uncertainty  can 
be  expressed  as  a  BDBP  problem.  The  standard  singular  value  tests  applied  to  the  BDBP  can  be 
excessively  conservative  because  they  ignore  the  block  diagonal  structure.  A  more  general 
non-conservative  test  (the  Small  u  Theorem)  is  developed  in  this  section  which  removes  this 
limitation.  By  non-conservative  we  mean  providing  a  necessary  and  sufficient  conuition.  Tne  test  is 
expressed  in  terms  of  a  new  measure,  the  structured  singular  value  u.  This  section  begins  with  review 
of  the  results  in  [5]  where  u  was  introduced. 


To  provide  a  more  precise  description  of  block  diagonal  perturbations,  let  K  =  (m1 ,m2, . . ■ ,mn, 
kltk2,..,kn)  be  a  2n-tuple  of  positive  integers.  All  the  definitions  that  follow  depend  on  K,  but 
to  simplify  notation  this  dependency  will  not  be  explicitly  represented.  Let 

n  n 

k  *  l  nijk,  and  m  =  I  m,. 

j-l  J  J  j»l  J 

Let  X  be  a  set  of  kxk,  rational,  block-diaginal  matrices  defined  by 

ml  ffl2  > 

r  — ,  r  - ■>  t  ' 

X  ■  {diag(A1.A1.—  ,i1.  a2,a2,-.,a2,  a3,— , 

i  for  each  j»l,  2,  ••*,  n,  is  a  k ^xk ^  matrix) 


Let  U  be  the  set  of  block  diagonal  unitary  matrices,  and  O’ the  set  of  real  diagonal  matrices  sucn  that 
H  -  {  diag(d1Ik,  d2Ik,  ...  d^  1^,  dm)+  ,  1^ . dm  IkJ  |  d  c.R  .+(0,  -))  (5.2 

What  is  desired  is  a  function  (depending  on  K) 

V-lnW  *  [0,  •)  (5.3 

with  the  property  that  VM 

det(I+Ma)  4  0  ■¥■  a*X,  7(a)<« 


iff  <u(M)<1 


(5.4) 
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This  could  be  taken  as  a  definition  of  u.  Alternatively,  u  could  be  defined  as 

0  if  no  AcX  solves  det(I+Ma)  =  0 

u(M)  =  (5.5) 

min  Io(a) idet( I+Ma)  *  0}  *'  otherwise 
AeX 

This  definition  shows  that  a  well-defined  function  satisfies  (5.4).  It  probaoly  has  little  aduuional 
value  since  the  optimization  problem  involved  does  not  appear  to  have  useful  properties. 

Using  these  definitions,  the  following  useful  properties  of  u  are  easily  proven. 


1)  w  (oM)  =  lain  (M)  -¥-Me2rt.(k) 

2)  u( I )  «  1 

3)  m(AB)  <  o(A)m(B)  -V-  A.oe^k) 

4)  u(&)  =  o(a)  -V-  AcX 

5)  If  n=l  and  m^=l  then  v(M)  =  o(M)  -y-Me^K) 

6)  If  n=l,  k^l,  then  k^, 

X  =  IXIiXcR  and  n (M)  =  p(M)  -¥•  M s TT^V) 


7)  If  AcX,  U^L  then  UAcX  and  AUcX  and  ofuAj^AuJ^A) 

8)  ¥  AcX  and  -V  Dc#  0a0_1=a 

9)  V-UcU  and  M  v(MU)  =  u(UM)  =  v(M) 

10)  *  Dc  and  M  vCDMD'1)  =  w(M) 

11)  max  p(UM)  <  w(M)  <  inf  a  (DMD*1)  V- 

UcU  ”  Dc)& 

Properties  5)  and  6)  show  that  the  structured  singular  value  has  as  special  cases  botn  the  spectral 
radius  and  the  maximum  singula*-  value.  Property  9)  means  that  u  is ‘U.-invariant, 

The  most  important  results  from  [5]  are  the  following,  whicn  deal  with  the  bounds  in  property  Hi: 

a)  The  left-hand-side  inequality  in  11)  is  always  an  equality.  This  expresses  u  in  familiar 

linear  algebraic  terms,  but  the  optimization  problem  involved  may  have  multiple  local  maxima. 


b)  The  right-hand-side  inequality  in  11)  is  an  equality  when  there  are  three  or  fewer  blocks,  and 
the  blocks  are  not  repeated.  The  blocks  themselves,  and  therefore  M,  may  be  of  arDitrarily 
large  dimension.  A  tedious  but  straightforward  computation  shows  that  the  optimization  problem 
involved  is  always  convex  [16].  Furthermore,  the  minimization  is  over  only  n-1  parameters  for 
n  blocks,  independent  of  block  size,  making  this  an  attractive  alternative  to  a). 

Note  that  the  transformation  OMD  '  is  simply  a  rescaling  of  the  inputs  and  outputs  of  M.  The  SSV 
is  invariant  with  respect  to  such  rescaling  (property  10),  while  singular  values  do,  of  course,  vary 
with  rescaling.  This  implies,  for  example,  that  the  ad  hoc  method  of  performing  a  change  of  units  can 
reduce  the  conservatism  associated  with  singular  values.  For  some  time  we  have  been  using  Osborne's 
technique  [17],  which  minimizes  the  Frobenius  norm  of  DMD"1  to  compute  frequency-dependent  D 
matrices.  We  now  have  new  algorithms  which  compute  D  to  directly  minimize  o(DMD"' ) . 


While  the  strongest  results  on  the  use  of  the  DMO”'  scaling  are  for  nonreoeated  blocks,  the  use  of 
scaling  before  computing  singular  values  can  be  quite  effective  in  treating  repeated  bloc*  problems. 
These  typically  arise  when  analyzing  robustness  with  respect  to  variations  in  scalar  parameters 
occurring  in  several  places  within  a  system.  Tnis  leads  naturally  to  problems  where  the  blocx-diayonal 
perturbations  are  made  of  either  nonrepeated  matrix  blocks  or  repeated  scalars.  In  this  case  the  set  x 
takes  the  special  form: 


Suppose  now  we  let 


X  *  Idiag  (x,  I,  x2I,  •••  xh  I,  Aj,  a?,  •••  a^) 
D  *  (diag  (D,,  02,  •••  D^,  d,I,  d2I,  ••  <^1)1 


where  the  structure  of  0  matches  X  and  the  are  full  block  matrices  and  the  d^  are  real  scalars. 
Then  an  upper  bound  for  u  with  this  structure  is  again 


The  use  of  full  block  D.'s  improves  the  bound  and  a  simple  argument  shows  that  the  problem  is  still 
convex.  Of  course,  the  scaling  could  be  extended  to  handle  repeated  nonscalar  blocxs,  but  these  have 
not  yet  been  bound  to  be  applicable  to  system  problems. 

Numerical  software  for  computing  g  has  been  developed  using  algoritnms  based  on  tnese  results.  In 
addition  to  using  this  software  to  analyze  some  simple  feedback  designs,  test  runs  have  been  made  on  a 
large  number  of  psuedo-random  matrices.  It  appears  that  the  global  maximum  in  a)  is  often  easily  found, 
although  a  simple  gradient  search  is  inadequate.  Also,  the  bound  obtained  in  b)  appears  to  be  quite 
good  (to  within  15*)  for  cases  of  more  than  3  blocks.  These  observations  are  most  encouraging, 
especially  considering  the  experimental  and  preliminary  nature  of  the  software. 

There  are  essentially  two  direct  applications  of  singular  values  to  the  BDBP  problem,  which  provide 
bounds  for  u: 

1)  Ignore  the  block  diagonal  structure  and  compute  7(M).  Tnis  gives  an  upper  bounu  for  g. 

2)  Treat  each  perturbation  one  at  a  time.  Compute  the  largest  maximum  singular  value  for  each  of 
the  corresponding  diagonal  blocks.  This  gives  a  lower  bound  for  g. 

The  gap  between  these  two  bounds  may  be  arbitrarily  large. 

An  extension  to  1)  was  proposed  by  Lehtomaki  ([15], [18]),  who  uses  the  singular  vectors  for  o(M) 
to  sharpen  the  bound.  Lehtomaki's  method  checks  for  structure  but  not  in  the  BDBP  form.  Tne  optimism 
of  2)  can  be  reduced  by  using  a  method  suggested  by  Freudenberg,  et  al  [19],  who  evaluate  the 
differential  sensitivity  of  the  singular  values  at  one  point  witn  respect  to  perturDations  at  anotner. 
Although  this  method  does  not  apply  to  simultaneous,  large  perturbations,  it  can  be  quite  useful  in 
indicating  when  the  lower  bound  for  g  obtained  by  method  2)  is  optimistic.  It  should  be  mentioned 
that  Lehtomaki  and  Freudenberg  did  not  presenc  their  techniques  in  the  context  of  the  BDBP  prooiem. 

5.2  Robustness  Analysis:  The  Small  g  Theorem 

The  preceding  discussion  of  u  and  the  BDBP  problem  has  dealt  witn  determining  the  size  of  tne 
minimum  structured  perturbation  &  that  causes  I  +  Ma  to  be  nonsingular.  We  are  interested  in  using 
the  structured  singular  value  to  answer  robustness,  sensitivicy,  and  performance  questions  for 
multivariable  feedback  systems.  The  connection  between  u  and  these  essential  feedback  properties  is 
provided  by  the  Small  g  Theorem,  which  characterizes  the  stability  robustness  properties  of  a  feedbacx 
system  with  respect  to  block  diagonal  perturbations.  In  order  to  state  the  Small  g  Theorem  we  need 
the  following  additional  definitions  depending  on  K: 

Let  L,  R  c  X  be  such  that  L  and  R  have  no  poles  or  zeros  in  the  open  right-half-plane.  Then  let 

*  (L’^eR  i  8eX  and  o(e(s))sl  V- Re(s)>  o)  (5.7) 

For  the  BDBP  problem  in  Figure  4.1,  1  is  the  set  of  allowable  blocx  diagonal  perturbations,  and  L  and  R 
are  the  weightings  for  the  a  such  that  7(LaR_1)  <_  1.  We  will  say  the  canonical  system  in 
Figure  4.1  is  stable  iff  I+Ma  is  nonsingular  in  the  closed  right-nalf-plane.  Altnough  this  definition 
does  not  distinguish  between  ill-posedness  and  instability,  it  is  adequate  for  our  purposes.  We  can  now 
state  and  prove  the  following: 

Theorem  5.1  (Small  g):  The  canonical  system  is  closed  loop  stable  for  all  at  *X  iff 

g.  *  sup  g(RML-1 )  <  1  (s.8) 

tl) 

Proof:  To  prove  the  if  part,  suppose  vc  <  1  and  let  a  t  X.  Tnen  usiny  properties  3)  and  11) 
and  the  definition  of  X 


6-18 


sup  p(MA)  =  sup  p(Ma)  <  sup  p(RML_1)  =  u  <  1. 

Res>0  s=ju  s=ju  c 

Thus  I  +  MA  is  nonsingular  for  all  Res  >  0.  Since  a  was  arbitrary,  the  canonical  system  is 
stable  for  all  a  e  X. 

Conversely,  suppose  u ( RML )  |  (ugmay  be  »).  Then  3  he'X  such  that  o(e)  <1  and 

det(l  +  RML^e)  _u  =  0.  Thus,  3  a  p“X,  a  det(I  +  Ma)|  _  =  0  and  the  canonical  system  is  not 

W-  Q  “WQ 

stable  for  all  a  cy  .  o 

This  theory  guarantees  that  if  u(RML“')  is  less  than  1  at  every  frequency,  then  the  closed-loop 
system  is  stable  for  all  structured  perturbations  Ae ‘X.  Conversely,  if  u(RML"')  is  greater  than 
or  equal  to  1  at  some  frequency,  then  there  exists  a  structured  perturbation  aeT(  that  results  in 
closed-loop  instability.  Note  that  a  destabilizing  a  can  be  expressed  as  LeR*^  for  some  constant  e. 

5.3  Performance  Implications 

As  noted  in  Section  4,  the  Small  u  Tneorem  can  also  guarantee  a  pre-specif ied  performance  level  by 
including  a  performance  block  in  the  B08P  problem,  furthermore,  this  performance  level  is  guaranteeo 
for  all  structured  perturbations  AeX.  These  claims  are  made  precise  by  a  corollary  to  the  Small  p 
Theorem  that  treats  performance.  Suppose  that  the  plant  uncertainties  are  given  by 

Ap  =  diag(Ar  a2,  ....  aJ  c*lr 

with  corresponding  weighting  matrices  Lr,  Rr,  interconnection  matrix  Mr,  and  2n-tuple  Kr  = 

(nip  ....  mn,  kp  ...,  kn).  Suppose  that  a  performance  specification  is  given  as 

“oyWV1* <  '  (5-9) 

Here  Mp  is  a  kpxkp  performance  matrix  which  we  desire  to  be  small  (as  weighted  by  Rp  and  Lp). 

Examples  include  Mp=( I+G'K)"' ,  Mp=G'K(I  +  G'K)*\  etc.,  as  discussed  in  Section  4.  Note  that 
Mp  depends  on  the  perturbation  Ar,  indicating  that  this  performance  should  be  met  for  all 
uncertainties. 

Let  M  and  M  denote  the  transfer  function  matrices  between  performance  outputs  anu 
pr  rp 

perturbation  outputs  and  between  perturbation  inputs  and  performance  inputs,  respectively.  In  terms  of 
these  matrices,  it  can  be  shown  that 

M  (a  )  =  M  +  M  a  (I  +  M  a  l’1  M  where  M  is  the  performance  matrix  in  tne 
p  r  p  pr  r  r  r  rp  p 

absence  of  uncertainties.  Define 


Ky  s  (1,  m.p  *  *  *  t  kp,  kp 

Lt  =  diag(Lp,  Lr) 

Rt  =  diag(Rp,  Rr) 

•xT  =k(kt) 

We  have  noted  the  dependence  on  K  here  to  avoid  confusion.  Of  course,  the  nominal  interconnection 
operator  M  is  assumed  to  be  stable. 

For  these  definitions,  the  following  relationship  exists  between  performance,  stability  robustness, 
and  the  SSV. 


pr 


rp 


Theorem  5-2:  (Robust  Performance) 

Hp'(ar)  stable  and  o(RpMp  ( a p) L"1 )  <  1  V- »  and  v- Ar«Xr  iff  wtRyhjL-j--1 )  <1  V-  <■> 

Proof :  It  follows  from  the  Small  u  Theorem  that  <1  4oJ 

iff  I  +  R^Li^e  >0,  4  Res  >  0,  4  e  such  that  Lj1  eRT  e  *X 

iff  4  Res  >  0,  jl  +  R^L"1  er|  >  0,  -¥-e  such  that  [J,  erRr  c 

and  |l  +  RpMp  (L/’e^Jlp-1  ep|  >  0,  -V  epsuch  that  L’1  epRp  e  7p 

iff  4  Res  >  0,  ll  +  M  i  |>  0  and  |l  +  R  M'  (All*1  el>  0  -V-  a  c  *  ,  4  en  sucn  that  [’  9  K  e  *X 

—  I  rr|  1  pp'r'  ppl  r  r*  p  pppp 

iff  Mp  (ar)  stable  and  <T(RpM^  (a[.)Lp"1)  <1  f  «  and  f  4f  c  Xr.  □ 

We  note  that  this  theorem  extends  the  Small  u  Theorem's  robust  stability  results  to  a  composite, 
simultaneous  result  on  robust  stability  and  performance.  Tnus,  given  an  uncertain  plant  model  witn 
structured  perturbations  and  a  performance  specification,  we  have  a  necessary  and  sufficient  condition 
in  terms  of  u  for  satisfaction  of  the  performance  spec  in  the  face  of  the  uncertainty.  If  the 
condition  u  <  1  is  met,  then  the  desired  performance  is  acnieved  for  all  perturbed  plants.  If  w  > 

1,  then  there  exists  a  structured  perturbation  whicn  causes  the  performance  spec  to  be  violated.  The 
robust  performance  condition  may  be  thougnt  of  as  arising  from  an  equivalent  "fictitious  uncertainty," 
although  this  interpretation  is  not  necessary. 

5.4  Hybrid  Compensators 

Consider  a  multi-rate  hybrid  compensator  for  a  continuous  plant  with  uncertainty.  Tne  compensator 
is  represented  via  a  sector  as  discussed  in  Section  3.4.  The  resulting  system  can  be  rearranged  as 
usual  to  obtain  the  representation  of  Figure  4.1  where 
a  =  diag  (Aj,a2) 

a,  =  diag(xnI,  x12I,  — ,  x^,  a„,  a]2,  a,^) 

a2  =  diag(A21,A22,  a^) 

The  Aj  contains  all  the  perturbations  to  the  continuous  plant  as  discussed  previously  in  this 
section.  The  a2  comes  from  the  sector  model  of  the  multi -rate  hybrid  compensator. 

Consider  the  set 

0  =  (diag{D.|,  02>  •”  Dn],  djl,  d2I,  ••  d  I,  Q,,  Q2,  ”,  Q^)) 

where  the  D^'s  and  dys  are  as  in  (5.6)  and  the  Qys  are  as  in  Section  3.4.  A  sufficient 
condition  for  stability  is  that 

sup  inf  0 (DM0  ^ )< 1  (5.1 

ui  Deff 

This  stability  test  combines  the  analysis  methods  of  Sections  3  and  4  to  provioe  a  reliaole  staDility 
test  for  uncertain  plants  with  hybrid  controllers.  Note  that  the  (Q.)  are  required  to  be  periodic 
with  the  same  period  as  the  associated  hybrid  compensator.  This  constraint  has  not  yet  been  exploited 
In  a  systematic  way.  A  somewhat  ad  hoc  approacn  would  be  to  ignore  the  perioaicity  and  compute  D  as  in 
Section  4.  The  periodicity  of  the  Q's  could  then  be  imposed  separately. 

Note  that  the  condition  in  equation  (5.10)  is  sufficient  for  stability  but  not  necessary.  Tne  use 
of  the  0  scaling  will  reduce  the  conservativeness  of  the  stability  test  but  not  eliminate  it  entirely. 

A  performance  result  analogous  that  in  Section  5.3  can  be  easily  obtained  for  the  use  of 
dtscrete-time  controllers.  As  in  Section  f  1  as  "fictitious  uncertainty"  block  can  be  introduced  to 
reflect  tne  performance  specification.  This  performance  block  must  be  treated  as  a  time-varying 


6-20 


perturbation  and  a  sufficient  performance  test  sucn  as  (5.10)  must  use  a  periodic  l)  for  the  performance 
block.  For  single-rate  problems,  the  controller's  period  can  be  used.  It  is  not  clear  how  to  best 
extend  the  performance  analysis  to  multi-rate  problems* 

6.0  INTEGRATED  FLIGHT  AND  PROPULSION  CONTROL  EXAMPLt 

6.1  Problem  Formulation 

The  use  of  SSV  analysis  is  illustrated  by  an  application  of  integrated  control  to  an  advanced 
fighter  aircraft  powered  by  a  mixed  flow,  low  bypass  ratio,  turbo-fan  engine.  Increased 
thrust-minus-drag  is  possible  in  large  portions  of  the  flignt  envelope  by  varying  the  engine  control 
schedules  and  limits.  In  the  subsonic  regime,  increased  thrust  is  available  during  flight  operation 
with  low  inlet  distortion.  For  example,  in  straignc  and  level  flignt  witn  low  sideslip,  increased 
thrust  could  be  obtained  by  shifting  the  fan  operating  line  upward  by  increasing  the  engine  pressure 
ratio  (EPR)  (refer  to  Figure  6.1).  Since  thrust  is  proportional  to  EPK  (at  constant  airflow;,  this 
"uptrim"  of  EPR  accomplishes  the  desired  increase  in  thrust. 

Thrust  increases  are  in  the  range  of  3-4t  at  a  0.9  Macn/3u,000  feet  operating  point,  unfortunately, 
moving  the  operating  point  decreases  the  fan  stall  margin.  Hence,  operation  in  an  uptrimmed  state 
requires  better  regulation  about  the  operating  point  to  prevent  inaavertent  compressor  stalls.  Hence 
the  control  objective  is  tighter  regulation  of  certain  engine  variables.  Conventionally  controlled 
engines  like  the  F-100  operate  with  16-18*  margins  as  shown  in  Table  6.1.  Here  “margin"  is  defined  as 

P  stall-P  nominal, 
p  nominal 

One  of  the  large  contributors  to  the  margin  requirements  is  maneuver  transients.  Existing  engine 
controls  treat  maneuvers  (aa,  ab)  as  unknown  disturbances  when,  in  fact,  this  information  is 
available  from  the  airframe/inlet  for  improved  control.  Tne  allocation  to  maneuvers  can  be 
significantly  reduced  by  integrated  control  of  the  flight  and  propulsion  systems. 

It  is  desired  that  margin  regulation  be  accomplished  without  adversely  affecting  thrust,  i.e.,  boun 
margin  and  thrust  are  to  be  regulated.  For  this  example,  performance  goals  are  specified  in  terms  of 
the  output  sensitivity  function.  Loop  properties  for  performance  are  specified  at  the  outputs  because 
the  example  is  focusing  on  output  regulation.  The  performance  goals  were  defined  as  a  low  frequency 
gain  of  about  40  about  1  rad/sec  bandwidth,  and  hign  frequency  sensitization  of  no  more  than  three. 

This  means  disturbances  will  be  reduced  by  a  factor  of  about  25  at  low  frequencies.  The  performance 
goals  are  described  by  a  performance  block  of  size 

i s+0.01 i 

3  TYtT.TT 

The  next  section  describes  the  airframe,  inlet  and  engine  model  used  to  design  the  “uptrim  regulator". 

6.2  Design  Models 

An  integrated  model  was  developed  by  interconnecting  linearized  airframe,  inlet,  and  engine  moaels. 
The  techniques  used  are  similar  to  those  used  in  the  Flight  and  Propulsion  Control  Coupling  studies 
[20]. 

A1 rf rame  -  A  linearized  longitudinal  and  lateral-directional  model  was  obtained  for  the  augmented 
airframe  at  a  0.9  Mach  30,000  foot  trim  condition.  An  eight-state  model  describes  linear  perturbations: 

Filter  state 
Pitch  state 
Pitch  angle 

State  *  Angle  of  Attack 
Velocity 
Altitu  ■> 

Angle  o.  ..rue-slip 
Side-slip  rate 


The  input  is 


Pi  ten  stick 
u  =  Rudder  pedal 

Net  thrust  (from  engine) 

In  addition  to  the  usual  airframe  measurement  of  load  factor  and  pitch  rate,  linearized  expression  were 
included  to  relate  changes  in  total  temperature  and  Mach  number  with  velocity  and  altitude  changes. 

aTt  =  fj(av,  ah) 

AM  =f2(Av,  Ah) 

These  quantities  will  be  inputs  to  the  inlet  model  below. 

Inlet  Model  -  The  inputs  to  the  inlet  model  are: 

a  -  Angle-of -attack 

8  -  Angle-of-Sideslip 

M  -  Mach  number 
T^  -  Total  temperature 

-  Engine  airflow 

The  outputs  are: 

PT2  -  Total  pressure  at  engine  face 

Kfl2  -  Distortion  index 

The  inlet  used  in  this  example  has  fixed  geometry  and  was  designed  for  subsonic  operation.  One  actuatoi 
positions  a  ramp  (pa)  based  on  an  angle-of -attack  schedule.  The  model  has  the  following  functional 
form  which  was  linearized. 

pa  =  f(»,  V,  h,  pa) 

Pj2  *  f(M,  TT,  Pft,  a,  8,  i»E) 

Ka2  *  f^pA’  M’  TT*  “•  8*  "E^ 

Engine  Model  -  The  engine  model  is  a  version  of  the  F-100,  a  low  bypass  ratio,  turbo-fan  engine.  The 
linear  model  was  supplied  by  Pratt  &  Whitney  and  was  developed  for  0.9  mach,  30,000  feet,  max  power 
operating  point.  The  engine  model  is  in  a  state-space  format 

x  =  Ax  +  Bu 
y  ■  Cx  +  Ou 

where  the  state,  control  and  outputs  are  defined  in  Table  4.2. 

Of  particular  interest  for  this  design  is  the  relation  of  fan  stall  margin  to  engine  inputs.  Tne 
fan  stall  margin  SMAF  is  defined  as 

5MAF  =  1  -  PRFAf|/S«VAN 

where  SPRpAN  is  the  fan  surge  pressure  ratio.  The  fan  surge  pressure  ratio  is  determined  from 
compressor  maps  as  a  function  of  corrected  air  flow  and  distortion  indes.  Tnus, 

SPRFAN4  f(Wc,  Ka2). 

Standard  linearization  techniques  are  used  to  derive  the  SMAF  response. 

Model  Uncertainty  -  The  integrated  model  is  linear,  time  invariant,  and  finite  dimensional,  and  thus  car 
be  represented  by  transfer  function  matrices  with  rational  elements.  Model  uncertainty  is  modeled  as 
sector-bounded,  linear,  time-invariant  operators. 

For  this  example,  four  uncertainties  are  added  to  the  nominal  loop  as  shown  in  Figure  6-2.  Tnree 
are  associated  with  the  uncertain  plant,  and  one  is  for  the  digital  compensator.  aa  represents 
actuator  uncertainty,  at  is  the  uncertainty  in  deriving  thrust  and  am  the  uncertainty  in 
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deriving  fan  stall  margin.  Ap  is  the  cone  radius  for  the  digital  compensator.  The  A-  are 
specified  by  *  I  and 

RA(j“)  ‘  TDDD" 
o  n  \  .  s+10 

„  ,  .  ,  s+1 

RT(ju)  '  100 

R0(j")  =  0.05 

and  their  frequency  characteristics  are  plotted  in  Figure  6-3. 


The  performance  goals  are  given  in  terms  of  the  return  difference  matrix  at  the  outputs  satisfying 
the  following  inequality: 

_  rric/i.w/t.n  3' ju+0. Oil  w  . 


Equivalently, 


£  [I+G(j-)K(»]  1  *“ 

o'  C(l*6(»K(j-))1  <  *. 

[(wojko))'1  i1 


This  constraint  on  output  sensitivity  is  equivalent  to  staoility  roDuscness  for  an  uncertainty  Ap 
from  row  4  of  Table  4.1  where 

The  “fictitious  uncertainty"  Ap  is  modeled  as  an  element  of  ( 0, Lp , Rp )  witn 

The  performance  specification  is  included  with  the  three  component  uncertainties  and  handled  witn  the 
structured  singular  value  analysis  in  the  next  section. 

6.3  Design  Application 

The  design  for  the  uptrim  regulator  is  intended  to  demonstrate  the  systematic  design  metnodology 
summarized  below.  The  design  of  this  multivariable  compensator  uses  advanced  LQG  techniques  for  shaping 
loop  transfer  matrices  in  the  frequency -domain  [14].  These  designs  are  presented  as  illustrative 
examples  and  are  not  intended  as  final,  flight-qual ity  control  laws. 

The  problem  formulation  emphasized  regulation  of  fan  margin  during  maneuvers.  Tms  may  result  in 
undesirable  responses  in  other  variables  such  as  thrust.  As  noted  in  the  modeling  section,  the  F-100 
eng'.-ic  has  available  multiple  inputs  for  regulating  engine  variables.  For  the  bare  engine  design  two 
outputs,  fan  margin  and  thrust,  were  identified  as  two  important  variables  to  be  controlled.  Two 
appropriate  engine  inputs,  nozzle  area  and  augmentor  fuel  flow,  were  selected  to  accomplisn  the 
regulation.  Other  combinations  of  inputs  and/or  outputs  could  be  selected.  Preliminary  designs  using 
the  six  inputs  to  the  engine  model  indicated  that  nozzle  area  and  augmentor  fuel  flow  were  effective  in 
regulating  the  selected  variables.  In  general,  one  would  use  the  inlet  contols  and  engine  controls  to 
hold  margins  during  maneuvers.  For  our  design  example,  however,  the  inlet  has  minimal  effect 
subsonically  so  our  example  will  use  just  engine  controls. 

Variables  not  regulated  would  have  to  be  checked  to  see  they  don't  exceed  normal  operating  limits, 
for  example,  temperature  constraints.  This  evaluation  was  not  part  of  our  example  design. 

Nevertheless,  this  two-input,  two-output  regulator  design  provides  a  realistic  exercise  of  our  LQG-basec 
methodology  and  SSV  analysis  techniques. 


The  design  methodology  is  summarized  in  the  following  steps: 
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Step  1  -  Design  a  KBF  such  that  the  loop  transfer  function  meets  performance  and  stability  robustness 
requirements. 

Step  2  -  Design  a  sequence  of  LQR's  with  the  scalar  parameter  q  allowed  to  take  on  consecutively  larger 
values. 

Step  3  -  Select  an  element  of  the  sequence  of  transfer  functions  whicn  adequately  approximates  the 
desired  transfer  function  over  the  frequencies  of  interest. 


The  system  description  is  a  state  space  model  consisting  of  A,  B,  and  C  matrices  as  developed  in  the 
modeling  section.  For  the  KBF  design  we  follow  the  advanced  loop  shaping  procedure  and  augment  the 
plant  with  the  desired  loop  shapes.  This  yields 


A  =  0  -.01 


0  0  -.01 


B  =  0 


Letting  N  =  I,  and  solving  for  the  KBF  gains  results  in  the  loop  properties  C(sl-A)"  KF  plotted  in 
Figure  6-4.  This  is  nothing  more  than  the  desired  response  and  completes  Step  1. 


To  achieve  loop  recovery,  a  sequence  of  LQR's  nill  ncv  be  designed.  The  state  weighting  matrix  is 

2~Tr 

q  C  C  q  - 

and  the  control  weighting  is  fixed  at  R=I.  Several  steps  of  the  sequence  for  q  =  0.003,  0.01,  0.03,  ant 
0.1  were  used  to  achieve  asymptotic  recovery.  The  final  iteration  is  judged  to  be  satisfactory  for  this 
design  (refer  to  Figure  6-5). 

The  corresponding  LQR  gains  are  combined  with  the  previous  KBF  to  form  the  overall  compensator.  Use 
of  the  asymptotic  procedure  results  in  fast  modes  in  the  filter  which  may  be  undesirable  from  an 
implementation  viewpoint.  They  are  easily  eliminated  by  standard  residualization  tecnniques.  In  this 
case,  there  were  two  modes  at  310  and  1190  ra d/s  which  are  clearly  outside  out  frequencies  of  interest. 
Elimination  of  these  two  modes  results  in  a  reduced  order  compensator  whose  loop  properties  are 
indistinguishable  from  the  full  state  design  shown  in  Figure  6-5. 


6.4  Analysis  of  the  Design 

The  singular  values  of  the  return  difference  matrix  <md  the  closed-loop  response  matrix  for  the 
design  with  the  reduced-order  compensator  are  shown  in  Figure  6-6.  These  plots  are  indistinguishable 
from  plots  where  the  loop  is  broken  at  the  input.  At  low  frequencies  (below  the  1  rad/sec  crossover), 
the  return  difference  matrix  is  dominated  by  the  loop  transfer  matrix  (Figure  6-5).  At  nigh  frequencies 
(above  the  1  rad/sec.  crossover),  the  closed-loop  response  matrix  is  dominated  by  the  return  difference 
matrix.  Both  of  these  plots  demonstrate  a  well-behaved  crossover.  The  example  design  provides 
disturbance  rejection  by  a  factor  of  at  least  60  at  frequencies  less  than  0.01  rad/sec. 


A  discrete-time  version  of  this  compensator  was  implemented  with  a  sample  time  of  0.1  sec.  A  third 
order  Butterworth  prefilter  cutting  off  at  10  rad/sec  was  included. 

The  structured  singular  value  (SSV)  discussed  in  Section  4  is  plotted  in  Figure  b-8.  Its  value  is 
less  than  unity  indicating  the  design  simultaneously  satisfies  the  robustness  and  performance 
requirements  previously  presented. 


A  time-domain  simulation  complements  the  frequency  domain  syntnesis  used  to  this  point.  Time 
histories  were  computed  for  a  pitch  stick  command.  Plots  are  made  with  and  without  the  “uptrim 
regulator".  Variables  plotted  are: 
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•  Angle -of -at tack 

•  Velocity 

•  Altitude 

•  Net  thrust 

•  Fan  stall  margin 

•  Engine  controls  (nozzle  area,  augmentor  fuel  flow) 

•  Engine  states  (Nl,  N2,  burner  metal  temperature) 

In  both  cases  tne  airframe's  response  is  nearly  identical.  Wien  the  regulator,  tnrusc  variations  are 
smaller  and  hence  the  velocity  and  altitude  profiles  vary  slightly.  The  peak  excursion  in  margin  is 
reduced  about  a  factor  of  2  over  the  unregulated  case.  Figure  6-9  shows  the  modulation  of  AJ  ana  WFAd 
used  to  accomplish  regulation.  Figure  6-10  shows  the  response  of  the  three  engine  states.  The 
simulation  shown  was  for  a  1 / 4g  incremental  "pull-up  maneuver".  If  the  results  are  scalea  to  a  "5g“ 
pull-up  the  perturbation  of  the  variables  are  shown  in  Table  6.3.  Note  that  less  than  a  10%  modulation 
of  nozzle  area  and  augmentor  fuel  flow  is  used  to  regulate  margin. 

7.  CONCLUSIONS 

This  paper  has  addressed  two  primary  topics: 

(i)  the  use  of  conic  sectors  to  represent  hybrid  compensators  and 

(ii)  the  use  of  the  structured  singular  value  to  analyze  linear  feedDack  systems. 

Starting  from  a  result  of  Thompson  [1],  we  showed  how  general  (even  unstable)  hybrid  compensators  with 
arbitary,  multiple  sample  rates  could  be  reliably  approximated  by  linear,  time-invariant  operators.  Tne 
approximation  error  is  properly  accounted  for  in  a  Conic  sector.  The  structured  singular  valve  provides 
a  nonconservative  measure  of  performance  in  tne  face  of  structured  uncertainty.  Tne  Small  u  Ineorem 
gives  a  necessary  and  sufficient  condition  in  terms  of  v  for  staoility  of  a  linear  system  witn 
multiple,  simultaneous,  norm-bounded  perturbations  of  arbitrary,  fixed  structure.  The  Robust 
Performance  Theorem  provides  a  similar  condition  for  the  satisfaction  of  performance  specifications  in 
the  presence  of  structured  perturbations. 

The  use  of  SSV  analysis  with  hybrid  compensators  was  illustrated  througn  the  design  example  of  an 
integrated  airframe/inlet/engine  control  mode.  The  control  objective  of  regulating  thrust  and  fan  stall 
margin  during  maneuvers  was  expressed  as  magnitude  constraints  on  the  output  sensitivity  function  in  the 
frequency  domain.  Model  uncertainty  was  assumed  at  the  actuators,  in  the  estimate  of  thrust  and  stall 
margin,  and  in  the  representatior  of  the  hybrid  compensator.  An  LQG-based  procedure  was  used  to  achieve 
loop  properties  which  satisfied  performance  and  robustness  properties  one-at-a-time.  The  SSV  was  used 
to  assess  simultaneously  the  effects  of  uncertainty  on  the  performance  and  robustness  of  the  system. 

Structured  singular  value  analysis  is  a  powerful  technique  for  quantifying  the  impact  of  uncertainty 
occurring  throughout  a  feedback  system  on  overall  closed-loop  performance.  At  the  present  time, 
synthesis  techniques  for  designing  directly  in  terms  of  the  SSV  are  lacking.  A  number  of  researeners 
are  addressing  this  topic  and  progress  is  expected. 
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Figure  2-2.  Illustration  of  a  Nonlinearity  Contained 
In  a  Conic  Sector. 


Figure  2-3.  Nyquist  Plane  Interpretation  of 
of  a  Conic  Sector  Condition 


Figure  2-4.  Feedback  Configuration  for  the 
Small  Gain  Theorem. 
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Figure  2-5.  Three  Equivalent  Representations  of 
the  Feedback  Configuration  Involving 
Operator  G  and  Sector  (C,  L,  R). 
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Figure  6-9.  Response  of  Thrust  and  Margin  With  and 
Without  Regulation 


Figure  6-10.  Response  of  Engine  States  With  and 
Without  Regulation. 
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7.1  SUMMARY 

The  design  of  the  dynamics  of  a  digital  control  for  satisfactory  transient  response  can  be  done  in  a 
number  of  ways.  One  of  the  more  effective  ways  is  to  do  the  design  so  that  the  poles  of  the  closed  loop 
system  are  in  desired  or  at  least  acceptable  locations.  Such  design  schemes  are  known  as  pole  placement 
methods.  In  this  section  the  method  of  pole  placement  will  be  described  and  formulas  suitable  for  computer 
implementation  will  be  given.  Also,  the  method  will  be  compared  to  both  the  transform  methods  described 
earlier  and  to  the  methods  based  on  optimal  control,  including  stochastic  control  and  the  Kalman  filter. 
Several  examples  will  be  given  to  illustrate  the  methods. 

7.2  SPECIFICATIONS  IN  TERMS  OF  POLE  LOCATIONS 

The  first  requirement  of  pole  placement  design  is  the  formulation  of  the  specification  in  terms  of 
pole  locations.  Three  approaches  to  this  part  of  the  problem  will  be  described.  These  are  the  dominant 
second  order,  the  higher  order  prototypes,  and  the  symmetrical  root  locus. 

(a)  Dominant  Second  Order.  The  second  order  transient  with  complex  roots  at  radius  u>n  and  damping 
ratio  C  is  well  documented  and  has  a  response  sketched  in  Fig.  7.1.  From  this  plot,  characteristics  such 
as  overshoot,  rise  time,  and  settling  time  can  be  identified.  For  example,  the  overshoot  is  mainly  dependent 
on  the  damping  ratio  and  the  relation  is  plotted  in  Fig.  7.2.  Also,  we  can  associate  rise  time  with  the 


Damping  ratio  r. 


Figure  7.1:  Response  of  second  order  Figure  7.2:  Relation  between  damping 

system  ratio  and  per  cent  overshoot 

natural  frequency  n  bandwidth  and  the  settling  time  with  Cwn,  the  real  part  of  the  corresponding  poles. 
Thus,  for  the  second  order  system,  we  can  express  the  relation  between  time  domain  specification  and  pole 
location  as 
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Solving  for  the  pole  parameter,  we  find 


G  1 

0.6^1 

0 j  > 

2.5/t 

n  — 

> 

n  — 

4.6/t 

damping  ratio 

natural  frequency  (radius) 

real  part 


(7.1) 


The  relations  7.1  tell  us  how  specifications  on  a  second  order  response  would  be  expressed  in  terms  of 
s-plane  pole  locations.  To  relate  these  to  digital  design,  we  use  the  relation 
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Figure  3:  Maps  of  pole  locations  to  meet  transient  specifications 

(a)  Rise  time;  (b)  Overshoot;  (c)  Settling  time 

With  Eq.  (7.2),  the  relations  of  Eq.  (7.1)  map  into  the  z-plane  as  sketched  in  Fig.  7.3. 

To  apply  the  technique  of  dominate  second  order  poles,  it  is  necessary  to  be  sure  that  the  response 
is  not  significantly  modified  by  other  poles  (or  zeros)  in  the  system.  While  bounds  are  hard  to  obtain, 
experiment  reveals  that  if  the  dominate  poles  are  in  the  right  half  of  the  z-plane,  then  an  extra  real  pole 
or  zero  in  the  left  half  of  the  z-plane  will  have  negligible  effect  and  will  have  small  effect  so  long  as 
the  added  pole  or  zero  has  a  smaller  real  part  than  the  dominate  poles  do. 

(b)  Prototype  Design.  In  some  cases  a  higher  order  than  second  can  be  used  effectively  with  all 
poles  interacting  in  a  specific  way.  Two  interesting  higher  order  systems  are  the  Butterworth  and  the 
ITAE  propotypes.  Butterworth  filters  are  designed  to  display  a  frequency  response  which  is  maximally 
flat.  The  ITAE  prototypes  were  developed  for  servomechanisms  so  the  step  response  would  minimize  the  error 
integral 


I  =  1 1  e|  dt 


which  is  the  _Inte8ral  of  Time  times  Absolute  Error.  The  transient  response  of  these  systems  are  shown  in 
Fig.  7.4  and  the  normalized  coefficients  for  the  ITAE  are  given  in  Fig.  7.5  The  poles  of  the  Butterworth 
are  equally  spaced  on  the  unit  circle. 


Normalized  time 


5  10  15 

Normalized  time 


Figure  7.4:  Transient  responses  of  (a)  Butterworth  and  (b)  ITAE  prototypes 


1  s  +  1 

2  s  +  0.707  i  JO. 707 

3  (s  +  0 . 7 08 1 ) ( s  +  0.521  ±  jl.068) 

4  (s  +  0.424  +  J1 . 263 ) (s  +  0.626  ±  JO. 4141) 

5  (s  +  0.8955*(s  +  0.376  ±  jl.292)(s  +  0.5758  ±  JO. 5339) 

6  (s  +  0.3099  ±  J 1 . 263 ) ( s  +  0.5805  ±  J0.7828)(s  +  0.7346  ±  JO. 2873) 

Figure  7.5:  Table  of  pole  locations  in  the  s-plane 
for  the- ITAE  prototype 

(c)  The  Symmetric  Root  Locus.  One  of  the  most  effective  and  widely  used  methods  of  linear  control 
system  design  is  the  linear  equation,  quadratic  loss,  Gaussian  noise  or  LQC  optimal  control.  The  method 
separates  into  two  problems:  the  linear  quadratic  regulator  (no  noise,  complete  state  feedback)  and  the 
least  squares  estimator  (Kalman  filter)  to  produce  estimates  of  the  state.  The  1QR  problem  is  to  find  the 
control  so  that  the  performance  index 


is  minimized  for  the  system 


*k  ■  Hxk 

We  can  show  that  the  optimal  control  is  given  by  the  linear  state  feedback  u  -  -Kx.  If  ve  define  the  open 
loop  transfer  function  as 


H0(z)  -  H(zl-4)_1r 


(7.5) 


then  we  can  show  that  (with  some  uninteresting  exceptions)  the  poles  of  the  optimal  closed  loop  system  are 
given  by  the  stable  roots  of  the  symmetric  root  locus 

det(l  +  p  Xq(z_1)  *0(z>)  -  0  (7.6) 

In  the  case  of  single  input  systems,  Eq.  (7.6)  is  a  scalar  root  locus  problem  with  respect  to  the 
parameter  p  which  weights  the  tracking  error,  y^,  versus  the  control  effort,  u^  in  the  loss  function 
of  Eq.  (7.3).  The  design  may  thus  be  accomplished  by  first  selecting  the  matrix  H,  which  defines  the 
"tracking  error,"  and  then  selecting  the  p  which  balances  the  Importance  of  tracking  error  against  control 
effort.  As  we  will  shortly  see,  once  the  poles  have  been  chosen,  in  the  single  input  case,  the  control  law 
K  can  be  computed  by  explicit  formula.  An  example  of  the  use  of  this  method  may  be  given  by  selecting  the 
double  integrator  plant  as  follows.  The  discrete  state  equations  for  sampling  period  1  second  in  terms  of 
position  x^  and  velocity  x 2  are 


x1(k+l) 


x2(k+l) 


K1  +  x2^  +  2 


x2(k)  +  u(k) 


(7.7) 


We  can  select  the  tracking  error  to  be  a  linear  combination  of  position  and  velocity  error  by  choosing 


y  “  Xj^  +  <xx  2 

From  Eqs.  (7.7)  and  (7.8)  we  compute  the  open  loop  transfer  function  to  be 

K0(z)  -  H(zI-$)-1 

a  -  1/2 

z  "  '  1/2 


(7.8) 


(7.9) 


'  (z  -  iv 


From  Eq.  (7.9)  we  see  that  the  open  loop  transfer  function  can  be  given  a  zero  anywhere  from  z  »  -1 
corresponding  to  a  ■  0  to  z  *  +1  corresponding  to  a  «  The  symmetric  root  locus  of  Eq.  (7.6)  for 
this  case  is 


where  8  »  (a  -  l/2)/(a  +  1/2)  and 
The  root  locus  for  8  ■  -1  (a  ■ 


1 

1 

p' 

0) 


+  p  Xq(z_1)  K0(z)  -  0 

,  z  -  8  z(z-l/B)  „ 
(z-l)2  (z-1)2 

«  p (a  +  1/2)2. 

is  sketched  in  Fig.  7.6a, 


0 


and  for 


8 


0.5  (a 


(7.10) 


1.5)  in  Fig.  7.6(b). 
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Figure  7.6a:  Symnetrlc  Root  Locus  for 
1/s^  Plant  and  a  ■  0 


igure  7.6b:  Symmetric  Root  Locus  for 
1/s^  Plant  and  a  «  1.5 
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As  can  be  seen  froa  these  sketches,  the  location  of  the  optimal  closed  loop  poles  can  be  substantially 
varied  by  choice  of  a  and  p.  However,  in  every  case  the  resulting  locations  are  optimal  for  some 
quadratic  performance  criterion  of  the  form  of  Eq.  (7.3). 

7.3  POLE  ASSIGNMENT  BY  STATE  FEEDBACK 


Once  the  dynamic  performance  has  been  expressed  in  terms  of  pole  locations,  it  is  necessary  to  consider 
the  structure  of  the  controller  which  will  cause  the  closed  loop  system  to  have  these  poles.  There  are 
several  techniques  to  solve  this  problem,  among  the  most  effective  being  the  separation  of  the  design  into 
two  parts:  a  state  feedback  control  law  and  a  state  estimator  to  replace  the  unmeasured  states  with  their 
estimates.  We  begin  with  the  state  feedback  to  assign  closed  loop  poles  to  specified  locations. 


The  algebra  of  the  case  is  given  by  the  state  equations 


x(k+l) 


♦  x(k)  +  T  u(k) 


(7.11) 


and  the  control  law  is  given  by 


where  K  is  a  row  matrix. 


The  specifications  are  given  by  the  fact  that  the  closed  loop  poles  should  be  at  desired  locations. 
One  way  of  expressing  this  requirement  is  that  the  closed  loop  characteristic  polynomial  is  specified  as 
a^fz).  Combining  chese  relations,  we  arrive  at  the  requirement  that 


det(zl  -  4  +  TK) 


(7.13) 


by  choice  of  K.  Equation  (7.13)  is  an  algebraic  equation  in  the  components  of  the  control  law  matrix  K 
and  one  way  to  solve  the  problem  is  to  work  out  the  form  of  the  determinant  on  the  left  hand  side  and 
equate  the  coefficients  of  the  several  powers  of  z  term  by  term.  One  obtains  n  equations  this  way,  and 
if  u  is  a  scalar,  K  has  n  components  so  we  have  n  equations  in  n  unknowns  and  the  promise  of  a 
unique  solution.  A  more  interesting  approach  leads  to  an  explicit  formula  easily  Implemented  on  a  computer, 
first  worked  out  by  Dr.  J.  Ackermann  (1972).  The  derivation  given  here  was  suggested  by  K.J.  Xstrom  in  a 
private  conversation  with  the  author. 

The  derivation  begins  with  the  observation  that  if  the  closed  loop  system  matrix  is  4>c  »  4  -  TK  and 
the  closed  loop  characteristic  polynomial  is  a  (z)  then  a  well  known  result  of  matrix  algebra  is  the 
Cayley-Hamilton  theorem  which  says  that  *c  satisfies  (*c  ■  0,  namely 


.  n  «n*— 1 

*  +  a.4  + 

c  1  c 


'  ■  +  Vl*c+  “n1 


(7.14) 


The  closed  loop  system  equations  are 


x(k+l)  -  $c  x(k) 

which  has  the  easily  obtained  solution  for  any  initial  condition 


Now  we  can  show  that 


x(k+n)  +  a.x(k+n-l)  +  ...  +  a  ,x(k+l)  +  a„x(k)  «  0  (7.16) 

x  n-i  l 

Because  we  obtain,  with  Eq.  (7.15)  substituted  into  Eq.  (7.16), 

+  ...  +  a  l)<t>k  x,,  -  0  (7.17) 

V  c  1  c  n  /  c  — 0 

Now  we  return  to  the  question  of  the  control  and  what  control  law  K  will  cause  Eq.  (7.16)  to  be  true  for 
given  4  and  T.  Proceeding  step  by  step  with  x(k+l  “  4>  x(k)  +  T  u(k)  we  find  that  Eq.  (7.16) 
requires  that 

($n  +  a^"-1  +  ...  +  anl)x(k)  +  r  u(k+n-l)  +  (<J>r  +  a1Du(k+m-2)  +  ... 

+  ($n_1r  +  a1*n"1r  +  ...  +  an_1r)u(k)  -  o  (7.18) 


Collecting  terms,  this  expression  can  be  written  as 

ac($)x(k)  +  [r  $r  ...  $n-1r]  fi 


a  ,  u(k+n-l ) 
n-1 


1 


u  (  k ) 
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We  define  Che  ran  matrix  (u  is  a  scalar  here) 


e  -  r  4>r  ...  $n_1r 


(7.20) 


and 


1  a1  a2 

Ola, 


Then  Eq.  (7.19)  can  be  written  as 


u ( k+n-1 ) 


u(k) 


-e-1  a  ($)x(k) 
c 


T  A 


The  entry  in  the  last  rov  on  the  left  is  seen  to  be  u(k)  alone  so,  finally,  if  e  ■  [0  0  .. 

nfJl  unit  vector  of  order  n. 


1],  the 


u(k)  =  -e^  C  1ac($)x(k) 


so  that  the  control  law  is 


K  «  eT  <?  ^a  ($>) 
n  c 


(7.21) 


Equation  (7.20)  is  Ackermann's  formula. 


The  form  of  Eq.  (7.21)  immediately  raises  the  question  of  the  existence  of  a  solution  for  K. 

Solution  according  to  this  formula  requires  Chat  the  matrix  C  given  by  Eq.  (7.20)  must  be  non-singular. 

This  matrix  is  called  the  Controllability  matrix  and  if  it  is  singular  then  some  dynamic  mode  of  the  system 
is  not  connected  to  the  input  and  cannot  be  controlled.  The  structure  of  the  system  must  be  changed  if  this 
mode  is  unstable  or  is  so  slow  that  it  can  prevent  the  satisfactory  control  of  noise  and  other  disturbances. 
Computing  the  controllability  of  a  dynamic  model  is  an  important  requirement  of  computer  aided  control  design 


To  illustrate  the  design  of  state  feedback  two  examples  will  be  given.  In  the  first  instance,  we 
consider  the  1/s^  plant  for  which 


"l  t" 

T2/2 

$  = 

;  r  - 

0  1 

T 

and  the  closed  loop  characteristic  equation  is 

det(zl  -  $  +  TK)  ■  0 


z2  +  TK2  +  (TZ/2)K1  -  2  z  +  (T2/2)K1  -  TK2  +  1  -  0  (7.22) 


Suppose  the  sampling  period  is  0.1  sec  and  the  dynamic  response  specifications  can  be  expressed  by  the 
desired  closed  loop  characteristic  polynomial, 


ac(z)  -  z2 

which  has  poles  at  radius  0.836  and  angle  17.0°. 
Equating  coefficients  in  Eq.  (7.22)  with  Eq. 


-  1.6z  +0.7  -  0 

(In  the  s-plane,  these  correspond  to  con  ■  3.6, 
(7.23),  we  can  solve  for  the  control  law  as 
=  10 
•  3.5 


a. 23) 

C  -  0.5). 


(7.24) 


These  gains  can  also  be  computed  by  Ackermann's  Formula,  as  shown  by  the  computer  printout  in  Fig.  7.7 


INPUT  2  POLES  IN  THE  FORM 
RADIUS .ANGLE ( DEGREES ) 

POLE  1  - 
1 

.836,17 

CONTROL  GAIN 
9.995444803 
3.5081224015 
D  C  GAIN  .100045572729 


2 

Figure  7.7:  Computer  Print-out  of  Program  CONLAW  for  1/s  Example 

The  response  to  an  initial  condition  of  x.(0)  «  1.0,  also  provided  by  the  program  CONLAW,  is  shown  in 
Fig.  7.8.  CONLAW  is  one  of  the  programs  in  the  set  called  DIGICON-85  for  which  a  brief  description  is 
given  in  the  appendix  to  this  presentation. 


7 

Figure  7.8:  Response  of  l/s  Plant  to  Initial  State  as 
Produced  by  CONLAW 

The  second  example  illustrating  the  design  by  state  feedback  models  a  servomechanism  with  the  sensor 
on  a  mass  coupled  to  the  motor  by  a  flexible  structure.  The  transfer  function  with  time  measured  in  units 
of  10  milliseconds  is 


V  "  s ( s  +  0. 513) (s  +  0.24  +  J3.12)  (7.25) 

With  a  sampling  time  of  T  -  0.6  which  is  6  milliseconds  in  real  time,  the  discrete  system  $  and  V 
are  computed  by  the  program  SAMFLE0  which  also  calculates  that  the  system  has  zeros  at  -6.99,  -0.884  and 
-0.113  and  has  poles  at  1.0,  -.257  ±j. 827,  and  0.735. 

For  this  case,  the  specifications  are  satisfied  by  the  fourth-order  ITAE  propotype  with  poles  at  radius 
0.654,  angle  72°  and  radius  0.535,  angle  24°.  The  output  of  CONLAW  is  shown  in  Fig.  7.9  and  the  corresponding 
step  response  in  Fig.  7.10. 


INPUT  4  POLES  IN  THE  FORM 
RAD IUS,ANGLE( DEGREES) 

POLE  1  - 
? 

.654,72 
POLE  3  - 
.535,24 
CONTROL  GAIN 
4.41149595241 
5.48399355082 
-3.36940938182 
.334075120487 
TRY  NEW  POLE  POSITIONS 
? 

N 


Figure  7.9:  Print-out  of  CONLAW  for  Fourth-order  Servomechanism 
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Figure  7.10:  Step  Response  of  Fourth-order  Servomechanism 
showing  ITAE  Response,  "y"  Marks  the  Output; 
"u"  Marks  the  Control 


7. It  STATE  ESTIMATION 

The  control  given  by  state  feedback  assumes  that  all  the  states  are  available.  In  practice,  of  course, 
only  a  very  few  states  are  measured  and  a  state  feedback  control  law  alone  is  almost  never  useable.  However, 
we  can  replace  the  actual  states  by  estimates  and  recover  the  response  to  command  inputs  as  if  true  state 
feedback  is  used.  The  calculations  go  like  this. 

Given  a  system  with  equations 

y(k)  =■  H  x(k) 

(7.26) 

x(k+l)  -  $  x ( k)  +  r  u(k) 

we  want  to  obtain  an  estimate  x(k)  such  that  the  error  x  «  x  -  x  is  quickly  small.  One  method  is  to 
use  a  model  of  the  system,  which  would  have  output  y  »  Hx  and  to  use  the  error  in  the  output  to  correct 
the  model.  The  resulting  equations  are 

x(k+l)  -  t  i(k)  +  Tu(k)  +  L(y(k)  -  Hi(k))  (7.27) 

The  error  in  this  estimate  is  easily  calculated  to  be 


x(k+l)  ”  ($  -  LH)x 


(7.28) 


Two  items  are  important  to  notice  with  respect  to  this  error  equation.  In  the  first  place,  we  notice  that 
the  control,  which  appears  in  Eq.  (7.26)  and  (7.27)  has  cancelled  out  of  the  error  equation,  (7.28).  Thus 
the  estimation  error  is  independent  of  the  scheme  used  to  compute  the  control.  In  the  second  place,  the 
correction  gain  matrix,  L,  appears  in  the  error  equation  in  a  manner  that  promises  to  give  the  designer  T 
some  influence  over  the  poles  and  hence  the  dynamic  response  of  the  estimator.  In  fact,  since  det  A  »  det  A  , 
the  characteristic  equation  of  the  error  system  (7.28)  is 


det(zl  -  9  +  LH)  =  0 

det(zl  -  $>T  +  HTLT)  -  0 


(7.29) 


and  Eq.  (7.29)  is  Identical 
we  have  HT  and  for  K  we 
solution  for  L  exists  for 
property  is  actually  called 
from  the  sensor  output  y. 
of  Ackermann's  Formula. 


T 

to  the  control  equation  (7.13)  except  in  place  of  <J>  we  have  4>  and  for  r 
solve  for  L^.  Furthermore,  because  of  this  identity,  we  can  be  sure  that  a 
an  arbitrary  i>  and  H,  provided  and  are  "controllable."  The 

"observable"  and,  if  it  fails  the  cause  is  a  dynamic  mode  which  cannot  be  seen 
Also,  because  of  this  identity,  we  can  solve  for  L  by  an  estimator  version 


The  estimator  described  by  Eq.  (7.27)  computes  the  estimate  at  time  k+1  from  the  sensor  output  at 
time  k  and  is  called  a  prediction  estimator.  Two  natural  variations  on  the  estimator  equations  give  the 
current  estimator  and  the  reduced  order  estimator.  The  current  estimator  equations  are 


x(k)  «  x(k)  +  b(y(k)  -  H  x(k))  Measurement 

up-date  (7.30) 

5T(k+l)  ■  $  x(k)  +  T  u(k)  Time  up-date 

These  equations  are  structurally  the  same  as  the  Kalman  filter.  The  Kalman  filter  involves  an  optimal 
choice  of  L  for  a  specific  set  of  assumptions.  The  reduced  order  estimator  is  a  prediction  estimator  for 
the  reduced  state  which  remains  when  we  use  the  output  y(k)  as  a  known  state  which  does  not  need  to  be 
estimated.  For  details,  see  Franklin  and  Powell  (1980). 

7.5  COMBINED  CONTROL  AND  ESTIMATION 

If  we  now  use  the  states  estimated  by  Eq.  (7.27)  in  place  of  the  actual  states  in  the  control  law, 
we  obtain  the  combined  system 
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x(k+l) 

=  x(k)  + 

x(k+l) 

-  $  x(k)  + 

u(k) 

-  -I&(k) 

y(k> 

=  Hx(k) 

If  we  replace  x  by  x  -  x  and  substitute  for  u  and 

x(k+l)  »  (<t-  K)x(k) 

x(k+l)  »  ($  -LH)x(k) 


u(k) 

u(k)  +  L(y-Hx) 

(7.31) 

y  in  (7.31),  we  obtain  the  combined  equations 
-  TKx(k) 

(7.32) 


Since  the  second  equation  in  (7.32)  depends  only  on  x,  the  dynamics  of  this  equation  are  those  of  $  -  LH. 
These  are  set  by  the  designer  by  choice  of  L  to  correspond  to  an  arbitrary  estimator  characteristic  poly¬ 
nomial  a  (z).  The  first  equation  in  (7.32)  has  x  as  a  forcing  function,  but  the  feedback  is  entirely  in 
the  matriS  4>  -  TK  which  imparts  dynamics  according  to  the  control  characteristic  polynomial  a  (z), 
selected  by  the  designer  by  choice  of  K.  The  overall  characteristic  polynomial  is  the  product  ca  (z)a  (z). 
Thus  we  see  that  the  control  and  estimation  separate.  c  e 


If  we  have  an  external  input  as  a  reference  command,  this  signal  can  be  added  to  the  system  via  the 
estimator  in  the  form  of  input  matrices  M  and  N  as  follows: 


x(k+l)  »  (4>-rK)x  +  L(y-Hx)  +  Mr 

u  -  -Kx  +  Nr 


(7.33) 


By  choice  of  M  and  N  the  designer  can  select  zeros  so  as  to  cause  the  overall  transfer  function  to  be 


Y(z)  _  T(z)b(z) 

U(z)  a  (zia  (z!> 
e  c 


(7.34) 


where  b(z)  are  the  zeros  of  the  process  being  controlled,  O  (z)  is  selected  arbitrarily  by  choice  of  L, 
ac(z)  is  selected  arbitrarily  by  choice  of  K,  T(z)  is  selected  arbitrarily  by  choice  of  M,  and  the 
overall  d.c.  gain  is  selected  by  choice  of  N.  It  is  most  common  to  select  N=*l  and  M“1  for  which  choice 
T(z)  »  ae(z)  and  the  estimator  poles  are  cancelled  from  the  command  input-output  transfer  function. 
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As  an  example  of  this  design  method,  we  consider  again  the  1/s  plant  with  sampling  rate  T-0. 1 
whose  response  with  state  feedback  is  shown  in  Fig.  7.8.  The  estimator  used  is  a  reduced  order  estimator 
with  L  chosen  to  make  a  (z)  «*  z-0.5.  The  input  matrices  M  and  N  are  selected  to  make  T(z)  ■  a  (z) 
and  gain  such  that  r(<»)  -ey(°°)  »  0  for  constant  reference  input.  Figure  7.11  shows  the  printout  of  the 
program  REDEST,  which  computes  L  and  the  dynamics  equations  of  the  controller  as  given  by  equations  7.34. 

The  step  response  of  the  resulting  controller  is  shown  in  Fig.  7.12,  as  given  by  the  program  RESP. 

The  response  to  an  initial  state  x^(0)  «  1  is  shown  in  Fig.  7.13.  Figure  7.13  should  be  compared  to 
Fig.  7.8  where  it  will  be  seen  that  the  response  with  estimator  has  more  "undershoot"  (-0.5  compared  to 
-0.16)  and  a  longer  duration.  This  is  the  price  paid  for  having  to  estimate  the  second  state  compared  to 
measuring  it  directly. 


An  alternative  to  the  state  space  methods  given  above  is  to  postulate  a  dynamic  controller  with  two 
inputs  (y  and  y)  and  one  output  (u)  and  to  solve  for  the  transfer  functions  directly.  We  model  the 
plant  as  a  transfer  function  rather  than  by  state  equations 


Y(z)  _  b(z) 
U(z)  =  a(z) 


(7.35) 


and  model  the  controller  similarly  as 

R(z)U(z)  -  -S(z)Y(z)  +  T(z)Yr 


(7.36) 


We  use  Y  (z)  for  the  command  signal  to  avoid  confusion  with  the  polynomial  R(z)  in  Eq.  (7.36). 
To  complete  the  design,  we  request  that  the  closed  loop  transfer  function  by  given  by 


_Y_  =  T  (z  )b(z ) 

Y  a  (z)a  (z) 

r  e  c 

From  Eqs.  (7.35)  and  (7.3b)  we  have 

a(z )Y(z )  -  b(z)[f  V(z) 

[R(z)a(z)  +  b(z)S(z)  lY(z )  =  b( z )T ( z )Yf ( z ) 


(7.37) 


(7.38) 


Comparing  Eq.  (7.38)  with  Eq.  (7.37)  we  see  immediately  that  the  design  can  be  accomplished  if  we  can  solve 
the  Diophantine  Equation 

R(z)a(z)  +  b(z)S(z)  ■  a  (z)a  (z)  (7.39) 

e  c 

for  given  arbitrary  a,  b,  a^,  ac>  If  a(z)  is  of  degree  n  and  b  is  of  degree  n  or  less  and  aeac 


t 
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Figure  7.11:  Print-out  for  Program  REDEST  for  1/s2  Plant 
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Figure  7.12:  Step  Response  of  1/s  Plant  with  Controller 
Including  Estimator 


W1 


Figure  7.13:  Response  of  l/sz  Plant  to  Initial  State  x^(0)  =  1 

ie  of  degree  2n,  then  a  solution  exists  for  R  and  S  of  degree  n-1  each  if  and  only  if  a  and  b 
have  no  common  factors.  Again  using  the  l/s^  plant  for  illustration,  we  have 


from  which 


z'  -  2z  +  1 


b(  z )  =  Y  (z  +  l) 


a  (z)  »  z  -  1.6z  +  .7 

c 


Letting  the  constant  T  /2  be  absorbed  in  S(z)  for  simplicity,  Eq.  (7.39)  appears  as 


(rjjZ+r^ ) (z  -2z+l)  +  (sgZ+s^Xz+l) 


2.1z  +  1. 5z  -  .35 


Equating  equal  powers  of  z  in  this  expression  we  see  immediately  that  rQ“l  and  for  the  other  unknowns 
we  write 


From  these  equations  we  find 


-  -.2375 


Removing  the  normalizing  factor  of  T  /2  *  0.005  we  get 


These  are  the  values  given  as  the  controller  numerator  in  Fig.  7.11  computed  by  state  variable  methods. 
7.6  CONCLUSIONS 


The  design  of  digital  control  systems  for  arbitrary  pole  placement  can  be  (and  has  been)  implemented 
in  a  very  straightforward  way  on  a  computer.  However,  the  simple  algebraic  formulas  suitable  for  low  order 
non-critical  systems  require  considerable  care  when  the  intention  is  to  implement  a  control  design  package 
which  will  provide  the  designer  with  trouble-free,  reliable,  inexpensive  graphic  responses  to  the  require¬ 
ments  of  computing  controllers  and  finding  discrete  poles,  zeros,  controllability,  observability,  control 
laws,  estimator  laws,  root  loci,  frequency  responses  and  alternative  parameters  for  realization.  To  meet 
these  objectives  a  great  deal  of  work  needs  to  be  done. 


7.7  APPENDIX 


DIGICON/85: 

Computer  Aid  for  the  Design  of  Digital  Controls  Using  the  HP-85 


X.  Introduction 


The  set  of  programs  which  constitute  DIGICON/85  are  for  the  computation  of  digital  controls  by  root 
locus  and  pole  placement  methods.  The  programs  are  written  in  BASIC  for  use  on  the  HP-85  desk  top  computer 
with  a  MATRIX  and  memory  ROM  expansion  and  consist  of  a  set  of  individual  programs  to  be  loaded  from  the 
tape  cartridge. 

The  notation  of  the  problems  to  be  solved  may  be 
vehicle,  servomechanism,  or  whatever)  is  described  by 
state  as  follows: 

summarized  as  follows.  The  process  (chemical  plant, 
equations  in  the  output,  y,  the  input  u,  and  the 

^k 

-  Hxk  +  Juk 

Output  equation 

(la) 

*k+l 

=  Fxk  +  GO^  +  wk) 

State  up-date  equation 

(lb) 

the  transfer  function  of  the  process  is 

g(z) 

-  H(zl  -  F)  *G  +  J 

(2) 

„  b(z) 
a(z) 

(3) 

b  zn  +  b, z"”1  +  . , 
o  1 

,  .  +  b 

n 

(A) 

n  ,  n-1  . 

z  +  a^z  +  . . , 

.  +  a 

n 

n(z  -  z  ) 

K  n<z  -  ?i) 

The  process  is  to  be  controlled  by  a  dynamic  system  which  may  or  may  not  include  integral  action.  With 
no  integral  action,  the  controller  equations  are  in  terms  of  the  process  output  y,  the  process  input  u, 
the  controller  state  xl,  and  the  external  command  input  as  follows: 

"k 

"  Cxlk  +  Dyk  +  Nrk 

(5a) 

xlk+l 

"  Axlk  +  B’,k  +  Mrk 

(5b) 

If  the  controller  is  to  include 
follows : 

integral  action,  then 

the  equations  include  the  integrator  state,  x2,  as 

“k 

“  x2fe  +  Cxlfe  +  Dyk  +  Nx2fc 

(6a) 

xlk+l 

-  ^  +  B*k  +  «*2k 

(6b) 

x2k+l 

-  x2,  +  K  (r,  -  y,  ) 

k  o  k  k 

(6c) 

The  controller  has  two  transfer  functions  since  the  output  of  the  controller,  u  (which  is  the  input  to  the 
process  under  control)  depends  on  two  inputs:  a  feedback  input  from  y  and  a  feedforward  input  from  r 
with  no  integral  action,  and  from  x2  with  integral  action.  For  the  case  of  Eq.  (6)  with  no  integral  action 
we  write 

C1(z)Y(z)  +  C2(z)R(z) 

a\ 

d  (z) 


where  the  roots  of  d(z)  =  0  are  the  controller  poles,  the  roots 
the  roots  of  C2(z)  ”  0  are  the  feedforward  zeros.  In  a  totally 
integral  action  by  the  two  equations 


of  C^(z)  =  0  are  the 
similar  way  we  express 


feedback  zeros,  and 
the  control  with 


U(z) 


C1(z)Y(z)  +  C2(z)X2(z) 
d(z) 


(8) 
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j 

i 


where  as  before  d,  and  C,  correspond  to  poles,  feedback  zeros,  and  feedforward  zeros.  In  Eq.  (6c), 

the  parameter  Is  part  of  the  control  law  designed  for  a  system  with  integral  control.  We  can  show  that 

using  the  effective  states  e  «  y-r  and  z  =  x.  ,.-x.  and  controls  v,  ■  u.  .-it  then,  if  rv+i“rv  =  0 
[a  constant  or  step  input]  the  equation  is  e  1  K 


0  F  J  z, 


From  Eq.  (10)  a  control  law  is  generated  as 


-K  e  -  Kz 
o 


and  the  Kq  in  (11)  is  the  parameter  that  appears  in  the  realization  (6). 

The  organization  of  the  computer  programs  of  DIGICON/85  is  as  a  set  of  separate  programs  to  perform 
different  parts  of  the  task  of  the  design  process  and  in  each  case  the  data  for  the  particular  task  Is 
obtained  either  from  the  keyboard  or  from  a  data  file  on  the  machine  tape  cartridge.  The  data  structure  is 
a  sequential  file  so  we  must  define  beforehand  the  exact  sequence  of  data  to  be  stored,  for  DIGICON  we  have 
two  sequences,  depending  on  whether  the  data  represents  a  process  (including  perhaps  integral  control)  or  a 
controller  (which  will  include  feedforward  zeros.)  The  process  data  the  sequence  is 

N1,N2,N3,N4,N5,T,L1,Z1,F,G,H,J,A,R9,B,R8,K,L 

and  for  controller  data,  the  sequence  is  identical  to  R8  but  is  then  different,  as  follows: 

N1,N2,N3,N4,N4,T,L1,Z1,F,G,H,J,A,R9,B,R8,C,R7,M,N 
The  meaning  of  these  data  is  as  follows: 

N1  the  number  of  states 

N2  the  number  of  Inputs 

N3  the  number  of  outputs 

N4  the  number  of  finite  zeros 

N5  the  number  of  finite  feedforward  zeros  in  a  controller  (set  to  zero  in  a  file  of  process  data) 

T  sampling  period  (set  to  zero  in  a  file  of  data  for  a  continuous  (non-discrete)  model) 

LI  pure  time  delay  in  the  model 

Z1  an  indicator  or  switch  variable  to  identify  the  data  type 

Z1  =  0  implies  a  file  of  process  data  with  no  extra  integral  action  included 

Z1  =  1  implies  a  file  of  process  date  with  integral  action 

Z1  *  2  Implies  a  file  of  controller  data 

F  process  or  controller  state  feedback  matrix  N1  x  N1 

G  process  or  controller  input  matrix  N1  x  N2 

H  process  or  controller  output  matrix  N3  x  N1 

J  process  or  controller  direct  transmission  matrix  N3  x  N2 

n 

A  matrix  coefficients  of  system  characteristic  polynomial.  If  the  polynomial  is  J  a  z  then 

A(I)  -  a  ^  A  is  N1  +  1  x  l  1=0 

R9  matrix  of  system  poles,  the  roots  to  a(z)  «  0.  R9  is  N1  *  2  and  the  first  column  holds  the 

real  parts  and  the  second  column  holds  the  corresponding  imaginary  parts 

B  matrix  of  coefficients  of  system  zero  polynomial  N1  +  1  x  1  stored  as  in  A 

R8  matrix  like  R9  but  holding  the  system  zeros.  R8  is  S4  k  2 

K  Control  law 

L  Estimator  law 

C  matrix  of  coefficients  of  the  feedforward  zeros  polynomial  of  a  controller.  Elements  stored  as 

in  A  and  B 


It  is  recognized  that  (6)  does  not  represent  the  most  general  case  since  terms  Nlr  could  be  added  to  (6a) 
and  Mlf  could  be  added  to  (6b).  These  terms  would  change  the  location  of  the  feedforward  zeros. 


R7 


matrix  of  roots  of  feedforward  zeros  polynomial,  stored  as  in  R8  and  R9.  R7  is  N5  *  2 


M  matrix  of  controller  feedforward  as  in  equations  (5)  and  (6).  SI  *  1 

N  matrix  of  controller  direct  transmission  as  in  (5)  and  (6) 

To  establish,  manipulate  and  produce  a  design  from  the  data  base  there  are  in  DIGICON  eight  programs. 
Brief  descriptions  of  these  programs  follow. 

IX.  Program  Descriptions 

-Xs 

ROOTLC :  This  is  a  program  to  plot  the  root  locus  of  the  equation  a(z)  +  Kb(z)  =  0  or  a(s)  +  Ke  b(s)  =  0 
versus  K.  The  data  for  a  and  b  may  be  obtained  either  from  the  keyboard,  a  file,  or  a  set  of  DATA  state¬ 
ments  contained  in  the  program.  The  program  computes  points  on  the  locus  by  searching  for  points  when  the 
phase  of  b/a  is  180°  and  K.  is  increasing.  The  user  specifies  a  starting  point  (typically  a  root  of 
a  “  0,  a  pole)  and  also  specifies  a  distance  between  points.  If  the  distance  is  too  large  the  program  will 
not  converge  to  a  valid  point  and  a  new  distance  must  be  tried.  No  provision  for  marking  points  of  a  specific 
gain  are  made  but  the  user  is  given  a  tabulation  of  root  locations  and  gain  values  from  which  points  of 
interest  can  be  marked. 

The  program  will  plot  the  results  in  either  the  s-plane  or  the  z-plane. 

INPUT :  This  is  a  program  to  build  a  file  of  data  describing  either  a  process  or  a  controller.  The  data  Is 
to  be  provided  from  the  keyboard  iji  state  variable  form.  The  program  converts  the  state  description  to 
observer  canonical  form  and  thus  obtains  the  characteristic  polynomial  and  the  zeros  polynomial.  These 
polynomials  are  solved  and  the  roots  obtained.  Program  INPUT  Initializes  K  and  L  for  a  process  data 
file  to  zero  and  stores  all  this  data  in  a  file  named  by  the  user.  A  convenient  mnemonic  for  flies  Is  to 
use  a  suffix  to  indicate  file  type.  For  example,  the  double  integrator  problem  might  have  associated  with 
it  the  three  files  S2.CS  for  the  continuous  process  data  (.CS  for  continuous);  S2.DE  for  the  discrete  model 
data  and  S2.C0  for  the  corresponding  controller. 

READ:  This  is  a  program  to  read  a  file,  print  a  copy  of  its  contents  and  make  another  copy  on  another  tape, 
if  desired. 

SAMPLfl :  This  is  a  program  to  compute  the  discrete  equation  matrices  <P  ,  T,  HI,  J1  from  a  file  of  data 
for  a  continuous  model.  The  program  includes  the  capability  to  have  a  pure  time  delay  in  the  model  and  to 
include  integral  control.  In  the  latter  case,  the  discrete  equations  are  in  the  error  space  form  of 
equation  (10).  The  algorithm  is  given  in  Franklin  and  Powell,  pages  171-177. 

POLY :  This  is  a  program  to  solve  for  the  roots  of  a  polynomial.  The  program  is  an  adaptation  of  the  standard 
pac  program  included  with  the  HP-85. 

CONLAW :  This  is  a  program  to  compute  K,  the  control  law,  by  "pole  placement"  so  that  the  closed  loop  system 

Vi  -  (F-GK)xk  (12> 

will  have  a  specified  characteristic  equation.  The  desired  poles  are  specified  by  the  user  and  the  gains, 

K,  are  computed.  A  transient  response  of  (12)  is  computed  to  aid  in  the  evaluation  of  a  selected  set  of 
poles. 

REDEST:  This  program  computes  a  Reduced  order  Estimator  for  a  given  process.  The  design  is  based  on  pole 
placement  of  the  estimator  poles  and  includes  feedforward  to  guarantee  that  the  reference  input  does  not 
excite  these.  The  program  allows  for  Integral  control  and  produces  matrices  for  either  equation  (5)  or  (6) 
as  is  appropriate. 

REST:  This  is  a  program  to  compute  the  RESPonse  of  a  discrete  system  using  either  equation  (5)  or  (6)  as 
the  controller,  and  (1)  for  the  process.  The  disturbance  w^  and  reference  r^  are  constants  as  step 
functions. 
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Summary 

Computerized  synthesis  techniques  of  modern  control  theory  are  in  widespread  use,  but  a 
number  of  fundamental  design  problems  still  remain.  We  call  them:  the  design  specifica¬ 
tions  problem,  the  free  design  parameter  problem,  the  plant  complexity  versus  controller 
simplicity  problem  and  the  dirty  design  environment  problem.  A  design  procedure  which 
comes  close  to  solving  these  design  problems  is  recommended:  It  is  an  iterative  design 
technique  using  a  performance  index  vector  which  provides  a  systematic  guidance  for  the 
designer  to  take  care  of  multiple  design  objectives  simultaneously  and  individually.  As 
a  design  tool  unconstrained  parameter  optimization  is  used.  A  practical  application  is 
briefly  reported:  The  design  of  a  robust  control  loop  for  a  fighter  aircraft  where  42 
performance  criteria  of  9  different  sorts  have  been  considered  simultaneously. 


1  . _ Introduction 


The  ultimately  achievable  performance  of  a  control  system  is  limited  by  hardware  con¬ 
straints  such  as  the  dynamics  of  the  plant,  the  actuators,  and  the  sensors.  The  actually 
achieved  performance  is,  in  addition,  a  product  of  control  design:  The  specification  of 
a  controller  structure  and  the  tuning  of  the  controller  parameters  determine  how  well  the 
design  objectives  are  met  within  the  given  limitations.  Hence,  in  order  to  improve  con¬ 
trol  performance  one  should  improve  the  control  design  as  well  as  the  underlying  design 
procedures . 

Design  is  a  trade-off  between  various  competing  objectives.  These  objectives  reflect 
the  properties  a  good  system  should  satisfy.  They  are  rarely  complete  and  quantitatively 
specified  at  the  outset.  Rather  they  are  an  open  list  and  formulated  in  qualitative  terms 
such  as:  The  control  system  should  be 

-  "fast  and  smooth"  in  response  to  reference  inputs,  there  should  be 

-  "small  static  and  dynamic  errors"  due  to  disturbances.  The  system  should  have  a 

-  "good  stability  margin"  in  order  to 

-  "tolerate  'large'  parameter  variations  and  modelling  inaccuracies."  But  the  system 

should  also  have  a 

-  "well  limited  bandwidth"  and  "low  feedback  gains"  in  order  to  be 

-  "insensitive  with  respect  to  measurement  noise".  Furthermore  control  action  should  be 

-  "well  within  actuator  saturation  limits"  and  so  on  ... 

The  designer  has  to  satisfy  all  these  objectives  in  the  best  possible  way.  How  can 
this  be  achieved? 


2.  Problems  when  Using  Synthesis  Techniques  of  Modern  Control  Theory 

Modern  control  theory  using  state  feedback  and  observers  e.g.  for  linear  optimal  con- 
trol,  pole  placement,  decoupling  and  disturbance  accomodation,  provides  much  insight  into 
the  analysis  and  synthesis  of  linear  systems.  Combining  this  conceptual  insight  with  the 
numerical  efficiency  of  a  digital  computer  results  in  a  set  of  useful  tools  to  assist  the 
design  process.  But  although  such  computerized  control  synthesis  techniques  are  in  wide¬ 
spread  use,  a  number  of  fundamental  design  problems  still  remain: 

The  Design  Specifications  Problem:  A  synthesis  technique  of  modern  control  theory 
usually  focuses  on  one  type  of  design  specification  only  (e.g.  pole  location;  decoupling; 
disturbance  accomodation)  and  is  based  on  an  exact  numerical  specification  (e.g.  all  pole 
positions,  exact  decoupling,  exact  asymptotic  disturbance  accomodation  for  prespecified 
frequencies).  This  requests  the  designer  to  map  all  design  objectives  onto  a  single  type 
of  mathematical  criterion  (e.g.  pole  placement)  which  often  narrows  too  much  the  designer's 
overall  view  of  system  performance.  Furthermore  an  exact  quantitative  specification  of 
design  objectives  is  rarely  possible  in  practice.  What  about  a  combination  of  different 
types  of  criteria  to  cover  the  overall  system  performance?  What  about  specifying  bounds 
rather  than  specifying  exact  numerical  values  for  performance  measures? 

The  Free  Design  Parameter  Problem:  The  free  design  parameters  in  the  Riccati  formalism 
are  the  weighting  coefficients  in  the  cost  functional.  How  to  choose  these  weighting  coef¬ 
ficients  systematically?  How  to  choose  pole  locations  as  the  free  design  parameters  in  a 
pole  placement  approach?  What  about  additional  free  parameters  in,  say,  a  full  order  state 
observer?  To  solve  these  decision  problems,  an  iterative  design  loop  is  necessary  based 
on  an  analysis  of  the  overall  system  performance  and  notably  the  plant  limitations.  Is 
there  a  strategy  available  for  systematically  reaching  an  overall  compromise? 


m r 
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The  Plant  Complexity  vs.  Controller  Simplicity  Problem:  Modern  control  theory  provides 
analytical  relations  between  the  structure  of  the  plant  and  the  structure  of  a  suitable 
controller.  This  means:  Simple  plant  models  yield  simple  controllers,  complex  plant  models 
yield  complex  controllers.  What  about  simple  controllers  for  complex  plants  which,  as  we 
know,  often  do  very  well  in  practice?  We  need  an  efficient  design  procedure  for  simple 
controllers  taking  into  account  realistic  and  thereby  rather  complex  plant  models.  Why 
not  use  parameter  optimization? 

The  Dirty  Design  Environment  Problem:  Modern  control  theory  yields  neat  solutions  for 
neat  problems.  Unfortunately,  the  design  environment  in  practice  is  rather  dirty.  What 
about  plant  nonlinearities,  parameter  uncertainties,  changing  characteristics  and  control 
saturation?  Can  we  handle  such  effects  directly  without  the  need  to  transcribe  them  into 
the  lineal  framework  of  modern  control  synthesis? 

These  design  problems  when  using  synthesis  techniques  of  modern  control  theory  are 
conceptual  ones:  They  are  consequences  of  a  search  for  analytical  solutions,  where  the 
computer  serves  only  the  purpose  of  fast  numerical  evaluations  of  (complex)  analytical 
relations.  They  cannot  be  removed  by  using  faster  computers  or  by  improving  the  man- 
computer  interface.  What  we  need  for  control  design  is  a  multi-objective  design  procedure 
with  a  systematic  decision  strategy  as  a  complement  to  control  theory  and  the  use  of 
computers  as  fast  searching  devices  for  finding  a  "best"  solution  in  a  (nonlinearly 
defined)  set  of  possible  candidates. 


In  order  to  handle  multiple  design  objectives  in 
formulated  by  mathematical  criteria.  This  is  a  quest 
the  state  space  or  the  frequency  domain  may  turn  out 
fying  a  particular  design  objective.  Control  theory 
design  criteria  have  to  be  included  for  a  special  de 
itative  insight  into  conflicting  demands.  But  contro 
design  as  a  multiple-criteria  decision  problem  as  it 
tude  of  different  design  objectives  which  have  to  be 
a  systematic  framework  which  guides  the  designer  to 
design  problem. 


a  systematic  way,  they  have  to  be 
ion  of  control  theory,  where  either 
to  be  more  appropriate  for  quanti- 
also  helps  to  decide  what  kind  of 
sign  purpose  and  it  yields  some  qual- 
1  theory  yields  no  strategy  to  handle 
naturally  results  out  of  the  multi- 
considered.  Hence  we  are  looking  for 
cope  with  a  multiple-criteria  control 


3.  Systematic  Design  Via  a  Performance  Index  Vector  and  Parameter  Optimization 


A  design  procedure  which  comes  close  to  solve  most  of  the  above  mentioned  design  prob¬ 
lems  has  been  suggested  in  [I].  It  has  proven  to  be  very  useful  in  various  practical 
designs  and  can  be  summarized  as  follows: 

Given  a  plant,  and  given  a  suitably  chosen  controller  structure  with  free  parameters 
k  =  [kj,...,k  ]  the  design  problem  is  to  determine  suitable  values  for  k. 

In  order  that  the  design  can  proceed  in  a  systematic  fashion  it  is  necessary  that  all 
design  objectives  are  taken  care  of  explicitly  in  the  design.  Therefore,  every  design 
objective  shall  be  rated  quantitatively  by  means  of  a  suitable  performance  index  J^(k). 

A  performance  index  J^(k)  is  called  suitable  if  J ^ (k ’ )  <  J^(k")  means  that  k'  satisfies 

the  i-th  control  objective  better  than  k"  does,  and  if  J-(k)  is  a  sufficiently  smooth 
function. 

For  ease  of  notation  we  define  the  performance  vector  J(k)  =  [  J-]  (k) , . . . ,  JL  (k)  ] . 

Moreover,  for  any  two  real  vectors  x,  y  we  say  that  x  <  y  if  for  all  components  x,  <  y, 
and  x  f  Z.  “  ~  i-i 

Iterative  Technique  for  a  Systematic  Design 

The  design  technique  is  iterative,  where  each  design  iteration  (the  v-th,  say)  com¬ 
prises  two  steps: 

Step  1 :  Choose  a  vector  of  design  parameters  cv  (c^v  >  o)  such  that 

J(kv_1)  <  c  <  cv_1 .  (1 

Step  2:  Find  kv  and  aQ  <  1  such  that 

J(kv)  <  a0cv.  (2 


The  design  iteration  is  initialized  by  using  an  initial  guess  k°  for  the  controller 
parameters  and  by  taking  c°  sufficiently  large. 

In  step  1  of  every  design  iteration,  cv  has  to  be  chosen  where  (1)  provides  a  well 
defined  margin.  This  choice  determines  tKe  design  direction.  Step  2  then  provides  the 
margin  for  the  next  oesign  iteration. 

It  is  desirable  that  this  margin  be  as  large  as  possible,  i.e.  a  should  be  as  small 
as  possible.  ° 


(3) 


After  the  v-th  design  iteration  we  have  from  (1),  (2)  that 

J(kv)  <  cv  <  cv_1  <  ...  <  c°, 

i.e.  we  have  a  monotonically  decreasing  sequence  of  design  vectors  and  the  performance 
vector  has  become  less  than  all  of  them.  This  establishes  the  systematic  behaviour  of 
the  design  prpcess. 

Design  Tool 

Basically,  any  method  which  is  able  to  perform  step  2  of  the  design  iterations  can 
serve  as  a  design  tool.  Here  we  shall  focus  on  a  method  which,  in  addition,  tends  to 
make  as  small  as  possible. 

The  smallest  value  of  aQ  satisfying  J(k)  <  aQcv  is  given  (as  a  function  of  k)  by 

ot0(k)  ■  max  (J.  (k)/c- v) .  (4) 

0  "  ISiSL  1  1 


Since  a  (k)  is  not  continuously  differentiable  everywhere,  we  consider  a  smooth,  i.e. 
at  least  0  Twice  continuously  differentiable  approximation  a(k)  instead,  where 


o(k) 


i  ln 


E  exp{pJi(k)/civ} 


i«1 


(5) 


and  p  >  0  is  arbitrary.  It  can  be  verified  that 


,  L  J. (k) 

a(k)  =  a  (k)  ♦  ^  ln  E  exp{p[-i— - a  (k)]}- 

°  -  P  i=i  c-v  0  " 


(6) 


Hence  0  S  a  -  aQ  S  (In  L)/p,  i.e.  a  can  be  made  as  close  to  aQ  as  desired  by  choice  of  p. 


The  minimization  of  a(k)  (for  example,  by  applying  Powell's  method  for  function  mini¬ 
mization  without  calculating  derivatives  [2])  can  now  be  used  as  a  design  tool  for  per¬ 
forming  step  2  in  each  design  iteration. 


The  sequence  of  design  iterations  finally  terminates  when  the  minimization  of  a(k) 
results  in  a  kv  with  aQ(kv)  2  1. 

As  in  all  parameter  optimization  problems  there  is  a  possibility  of  local  minima. 
Therefore,  instead  of  using  efficient  local  optimization  algorithms  such  as  Powell's  [2], 
one  may  as  well  use  global  optimization  algorithms  such  as  random  search  algorithms. 
However,  the  latter  are  known  to  be  less  efficient. 

Note  also  that  the  use  of  optimization  algorithms  which  assume  that  the  function  to  be 
minimized  is  twice  continuously  differentiable  (which  most  efficient  local  algorithms  do) , 
requires  that  the  performance  indices  themselves  must  be  twice  continuously  differentiable 
This  imposes  a  certain  restriction  on  the  mathematical  formulation  of  performance  indices. 


4.  Practical  Application  Considerations 

The  above  design  procedure  using  a  performance  index  vector  and  parameter  optimization 
solves  most  of  the  design  problems  stated  in  section  2: 

The  design  procedure  provides  a  systematic  framework  to  take  care  of  multiple  design 
objectives  simultaneously  and  individually.  The  design  objectives  car  be  formulated  by 
criteria  of  different  kind  either  in  state  space  or  frequency  domain.  Hence  the  design 
specifications  problem  can  be  tackled  in  the  most  direct  way.  However,  the  design  proce¬ 
dure  does  not  provide  a  priori  guidelines  what  design  objectives  are  appropriate  for  a 
particular  design  problem.  It  remains  in  the  designer's  responsibility  to  decide  what 
design  criteria  he  shall  use.  This  decision  must  be  based  on  his  knowledge  of  control 
theory  and  the  operational  requirements  he  has  to  satisfy. 

The  design  procedure  provides  a  systematic  guidance  for  the  designer  to  cope  with  the 
free  design  parameter  problem:  For  each  individual  perfomance  index  there  is  associ¬ 
ated  a  free  design  parameter  c^  which  can  be  chosen  by  the  designer  within  a  well-defined 

margin.  Typically,  one  starts  with  sufficiently  large  values  for  the  design  parameters 
and  then  successively  reduces  them  so  as  to  improve  certain  criteria  while  keeping  possi¬ 
ble  deteriorations  of  others  in  tolerable  bounds.  Whatever  choice  c^v  in  the  given  margin 
is  taken,  it  is  always  guaranteed  that  S  c^v  and  hence  a  possible  degradation  of  the 
performance  index  is  well  bounded.  If  c^v  *  (kv  ')  is  chosen,  then  an  improvement  of 

is  guaranteed  provided  only  that  such  an  improvement  is  possible  at  all  at  the  expense 

of  degrading  other  performance  indices.  This  allows  to  explore  the  design  possibilities, 
and  to  reach  a  desirable  trade-off  between  competing  objectives,  step  by  step. 
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In  each  step  it  is  the  responsibility  of  the  designer  to  decide  upon  the  design  di¬ 
rection,  i.e.  which  performance  index  shall  be  improved  and  what  is  a  permissible  expense 
in  degrading  other  performance  indices.  The  design  procedure  then  guarantees  a  step  by 
step  monotonic  improvement  in  the  direction  specified  by  the  designer.  The  admissible 
margins  of  the  design  parameters  which  are  updated  in  each  design  iteration,  yield  some 
information  how  easy  or  difficult  it  is  to  reach  the  individual  design  objectives.  Hence 
this  design  procedure  makes  it  possible  to  explore  the  system's  limitations  in  control 
design. 

The  design  approach  is  well  suited  to  deal  with  practical  controller  realization  con¬ 
straints.  This  is  a  consequence  of  using  parameter  optimization  as  a  design  tool  instead 
of  using  an  analytical  control  synthesis  technique.  There  is  no  "controller  simplicity 
vs.  plant  complexity  problem"  as  we  called  it  earlier.  The  designer  can  specify  the  con¬ 
troller  structure  at  will.  He  is  free  to  combine  control  considerations  with  technical 
realizability  constraints.  This  freedom,  however,  calls  much  at  the  designer's  experience 
and  physical  insight  into  the  type  of  system  to  be  controlled.  Furthermore,  in  view  of 
the  numerical  convergence  properties  of  parameter  optimization,  one  also  has  to  take  some 
care  in  mathematically  specifying  the  controller  structure,  i.e.  in  parameterizing  the 
control  law:  There  must  not  be  redundant  parameters.  In  addition,  the  free  parameter  to 
be  optimized  should  have  the  same  order  of  magnitude,  i.e.  proper  scaling  may  be  necessary 
On  the  other  hand,  the  freedom  to  specify  a  controller  structure  makes  it  possible  to 
develop  a  proper  structure  in  an  iterative  process:  Start  with  a  simple  structure,  say  a 
P-I  controller,  and  extend  it  successively  by  additional  dynamic  degrees  of  freedom,  i.e. 
use  a  higher  order  controller,  if  the  design  iterations  yield  no  further  improvement  with 
the  formerly  chosen  structure.  In  this  way,  one  can  explore  the  trade-offs  between  con¬ 
troller  simplicity  and  control  system  performance. 


The  design  procedure  is  open  to  handle  complex,  nonlinear  systems.  There  is  no  con¬ 
ceptual  necessity  for  linearized  or  low  order  plant  models.  But  there  may  be  computer 
time  limitations:  In  view  of  parameter  optimization  the  performance  indices  have  to  be 
evaluated  very  often  and  hence  sufficiently  fast.  For  nonlinear  systems  there  are  usually 
no  analytical  relations  for  such  evaluations,  rather  the  performance  indices  have  to  be 
computed  on  the  basis  of  system  simulations.  This  limits  the  complexity  of  system  models 
to  be  used  by  the  speed  of  available  computers  and  the  efficiency  of  available  simulation 
software. 

For  practical  applications  of  the  design  procedure  via  performance  index  vector  and 
parameter  optimization,  a  user-oriented,  modular  design  software  package  REMVG  [3]  is 
available.  This  design  package  has  been  successfully  applied  in  solving  various  non¬ 
standard  design  problems.  One  such  application  is  briefly  described  in  the  next  section. 


5.  Application  Example:  Robust  Control  Loop  Design  for  a  Fighter  Aircraft 

This  application  is  well  documented  in  [4],  Here  we  shall  give  a  brief  overview  of  the 
problem  to  be  handled,  the  sort  and  number  of  performance  indices  used  and  the  final 
result. 

Control  problem:  For  a  fighter  aircraft  (type  F-4C)  a  stability  augmentation  system 
shall  be  designed  to  improve  the  longitudinal  handling  qualities.  This  shall  be  achieved 
with  a  fixed  gain  controller  which  covers  the  whole  flight  envelope  without  gain  sched¬ 
uling.  Furthermore  only  pitch  rate  (0) ,  the  variable  to  be  controlled,  shall  be  used  for 
feedback.  As  structure  of  the  controller,  a  third-order  compensator  has  been  specified, 
where  ten  constant  controller  parameters  kj  are  to  be  assigned  (figure  1).  The  longitu¬ 
dinal  motion  of  the  aircraft  is  modeled  by  a  linearized  second  order  short  period  motion 
description  of  the  aircraft  plus  a  first  order  actuator  system.  In  the  flight  evelope, 
five  (extreme)  flight  conditions  shall  be  considered  (figure  2). 

Performance  Criteria:  The  following  performance  criteria  are  specified  for  each  of  the 
five  selected  flight  conditions. 

lE§i5££2Ey_2Ei£5Ei§_f2I.§iSE-E5§E2D§5 : 
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Ji(k)  “  5  [0i(t)/O.(»)  -  Gm(oi-t)J^dt  (i  =  1  ....  5) 

o 

Here  0m  is  a  desired,  normalized  step  response  modeled  by  a  second-order  system,  denote 
different  time  scales  for  each  individual  flight  condition. 


Ti 

2)  J5  +  i  (k)  =  /  lni(t)/ni  (“»)  ]2dt 


(i  =  1  - - S) 


Traje S52EY_2EiE®Ei§_£2I-disturbance_re2ectioiu 


T . 


^  jio+i (y  = ;  ®i(o  dt 


(i  =  1....5) 


£i 


Here  denote  the  desired  settling  times,  after  which  the  pitch  rate  regulation 
error  is  to  be  uniformly  less  in  magnitude  than  S  per  cent  of  the  initial  perturbation. 


V 


4)  Jis+i(y  =  /  [^cordt 
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Here  is  the  associated  rate  of  elevator  motion  for  disturbance  rejection. 

To  guarantee  a  desired  degree  of  damping: 

5)  J20+i(y  =  [Im(X)/Re(X)J^  (i  =  1 ,  —  S) 

To  guarantee  a  desired  degree  of  absolute  stability: 

6)  J2S*i^  =  exP  {[RetA^£x}  (i  =  1.-..5) 

To  limit  the  maximum  eigenf requency  in  order  to  avoid  structural  mode  excitation: 

7>  Jjo.i®  - 


£25£E9iiSE_92SffiSitO£§-$Ei£®li§: 

To  bound  the  controller  eigenvalues: 

8)  J35+j(k)  =  1/ikjl 

To  limit  feedback  gains: 

9)  J3S*j(y  =  |kjl- 


(j  =  1,2,3) 

(j  =  4, ...7) 


Hence  in  this  control  design,  9  different  sorts  of  performance  criteria  are  used. 
The  total  number  of  performance  criteria  is  42.  The  number  of  controller  parameters 
to  be  designed  is  10. 

Result : 


The  final  design  result  is  shown  in  figures  3  and  4:  It  was  possible  to  design  a  fixed 
gain  controller  which  covers  the  entire  flight  envelope  without  gain  scheduling  and 
which  uses  pitch  rate  feedback  only.  This  novel  result  has  been  achieved  in  about  20 
design  steps,  where  each  design  step  required  about  60  s  computer  time  on  a  AMDAHL 
470/V6  computer.  More  details  and  results  can  be  found  in  [ 4 J . 
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Figure  1  : 

Compensator  structure  (3rd  order) 
Parameters  to  be  designed  k.  . ..k.. 
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Figure  3:  Command  step  response  Q 

Flight  conditions  1  ...  5 


uncontrolled 


controlled 


Figure  4:  Disturbance  step  response  0 
Flight  conditions  1  ...  5 
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SUMMARV 

Practical  problems  associated  with  digital  computer  implementation  control  laws  are 
discussed.  The  key  problem  is  to  convert  a  digital  control  law  in  state  space  or 
polynomial  form  into  a  computer  program  which  gives  the  desired  results.  The  paper 
covers)  sensor  and  actuator  interfaces,  analog  prefiltering.  actuator  saturation, 
anti -windup,  numerics  and  coding. 


1.  INTRODUCTION 

This  paper  deals  with  practical  aspects  on  implementat ion  of  digital  control  laws. 
The  starting  point  is  a  description  of  a  control  algorithm  in  terms  of  a  linear  dynamical 
system  either  in  state  space  form  or  in  transfer  function  form.  A  summary  of  these  form 
is  given  in  Section  2.  Analog  prefiltering  is  a  necessity  when  realising  digital  control 
laws.  This  is  discussed  in  Section  3.  The  consequences  of  the  dynamics  of  the  prefilters 
and  of  the  computational  delay  is  also  covered  in  this  Section.  Although  many  control 
laws  can  be  designed  using  linear  theory  it  is  necessary  to  take  nonlinearities  into 
account  in  the  implementation.  The  special  case  of  actuator  saturation  which  is  very 
common  is  discussed  in  Section  4.  Consequences  of  roundoff  and  finite  word-length  in  the 
calculations  are  discussed  in  Section  A.  A  more  detailed  treatment  of  the  topics  of  this 
paper  is  given  in  Cl 3 . 

2.  DIFFERENT  REPRESENTATIONS  OF  THE  REGULATOR 

Linear  design  methods  give  control  laws  in  the  form  of  linear  dynamical  system.  Such 
systems  can  be  represented  in  many  different  ways. 


Siii*_f**sl&a£!£_si£b_tG_B2‘fili£i£_s&ss£¥f£ 

Pole  placement  or  LOG  design  result  in  a  control  law  of  the  form 


x(k | k>  - 

x <k | k-1 )  +  Ky (k)  -  y(k|k-l)3 

u<k)  - 

LCx  <k)  -  x ( k | k) 3  +  D  u  (k) 
m  c  c 

A 

A 

x (k+1 1 k  >  ■ 

Ax(k | k>  +  Bu(k) 

Cl) 

x  (k+1)  » 

f(x  (k).  u  <k)). 

m 

m  c 

A 

A 

y(k+i|k>  * 

Cx(k+1 |k> 

A 

>  x  is  an 

estimate  of  the  process  state  and  x 

m 

is  the  state  of  the  model  which 

generates  the  desired  response  to  command  signals  u  .  Notice  that  a  nonlinear  model  for 

c 

the  desired  state  may  be  used  in  this  representation. 


QlO«£ll_Sfcfl£C_BlB£BlBD&l&ia& 

If  the  function  f  in  (1)  is  linear  the  regulator  given  by  (1)  is  a  linear  system 
with  the  Inputs  y  and  u  and  the  output  u.  Such  a  regulator  may  always  be  represented  as 

c 

1u<k>  -  Cx<k>  +  Dy  (k)  +  D  u  <k> 

c  c 

x(k+l)  -  FxCk)  ♦  Gu(k)  +  G  u  <k>.  <2> 

c  c 

This  form  is  more  compact  than  <1>.  The  state  may.  however,  not  necessarily  have  a  simple 
physical  interpretation. 
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Several  design  methods  result  in  a  description  of  the  regulator  in  terms  of  a  linear 
transfer  function  model.  The  general  form  of  such  a  model  can  be  written  as 

R(q>  u(k)  *  T(q)  u  (k>  -  S(q)  y(k)*  C3) 

c 

where  R(q)»  SCq)  and  T(q)  are  polynomials  in  the  forward  shift  operator  q. 

There  are  simple  transformations  between  the  different  rep resentat ions  of  the 
regulators. 


t!9S_to_imgiement_a_disc ret e_ time _sy stem 

The  implementation  of  a  discrete  time  system  described  by  (2.1)*  (2.2)  or  (2.3) 
using  a  digital  computer  is  straightforward .  The  computer  code  for  implementation  of  the 
regulator  given  by  equation  (2.2)  is> 

Procedure  Regulate 
begin 

1  Ad in  y  uc 

2  u  »*  C*x  +  D*y  +  Dc  *  uc 

3  x  s  =  F*x  +  G*y  +  Gc  *  uc 

4  Daout  u 
end 

Analog  to  digital  conversion  is  commanded  in  the  first  line.  The  appropriate  values 
are  stored  in  the  arrays  y  and  uc.  The  control  signal  u  is  computed  in  the  second  line 
using  matrix  vector  mult ipl icat ion  and  vector  addition.  The  state  vector  x  is  updated  in 
the  third  line*  and  the  digital  to  analog  conversion  is  performed  on  line  four.  To  obtain 
a  complete  code  it  is  also  necessary  to  have  type  declarations  for  the  vectors  ui  uci  x 
and  y  and  the  matrices  F*  Oi  Get  Ct  D  and  Dc.  It  is  also  necessary  to  assign  values  to 
the  matrices  and  the  initial  value  for  the  state  x.  When  using  computer  languages  which 

do  not  have  matrix  operations  it  is  necessary  to  write  appropriate  procedures  for 

generating  matrix  operations  using  operations  on  scalars. 

The  details  depend  on  the  hardware  and  software  available.  To  show  the  principles  it 

is  assumed  that  the  system  described  by  (2>  should  be  implemented  using  a  digital 

computer  with  A-D  and  D-A  converters  and  a  real  time  clock.  The  execution  of  the  program 
is  controlled  by  the  clock,  which  initiates  the  execution  of  the  code  at  each  clock 
interrupt.  The  sampling  period  is  thus  determined  by  the  time  between  the  clock  pulses. 

It  is  thus  straight  forward  to  implement  a  digital  control  law.  To  obtain  a  good 
control  system  it  is  however  necessary  to  also  consider!  numerics*  sensors*  actuators* 
operational  aspects  and  programming  aspects.  These  will  be  discussed  in  the  following 
sections. 


3.  PREFILTERING  AND  COMPUTATIONAL  DELAY 

To  obtain  a  satisfactory  digital  system  it  is  necessary  to  filter  the  analog  signals 
before  they  are  sampled.  It  is  also  necessary  to  consider  the  dynamics  caused  by  the 
prefilter  and  the  computational  delay. 


60iIea_BrSfiItering 

To  avoid  aliasing  it  is  necessary  to  use  an  analog  prefilter  for  elimination  of 
disturbances  with  frequencies  higher  than  the  Nyquist  frequency  associated  with  the 
sampling  rate.  In  signal  processing  applications  the  analog  prefilter  is  determined 
frequency  content  of  the  signal*  see  C21  and  C31 .  In  a  control  problem  there  is  normally 
much  more  information  available  about  the  signals  in  terms  of  differential  equations  for 
the  process  models  and  possibly  also  for  the  disturbances.  An  analog  Kalman  filter  would 
be  a  very  good  prefilter*  because  it  can  be  based  on  a  detailed  description  of  the 
signal.  There  are  several  advantages  in  implementing  the  Kalman  filter  in  a  computer.  In 
such  a  case  it  is  useful  to  sample  the  analog  signals  at  a  comparatively  high  rate  and  to 
avoid  aliasing  by  an  ordinary  analog  prefilter  designed  from  the  signal  processing  point 
of  view. 


The  bandwidth  w  of  the  prefilter  is  inversely  proportional  to  the  sampling  period 
B 

h.  A  common  rule  of  thumb  is  to  choose  the  sampling  period  so  that  u  h  «  0.5  -  1. 

B 


The  precise  choice  depends  on  the  order  of  the  filter  and  on  the  character  of  the 
measured  signal.  The  dynamics  of  the  prefilter  should  be  taken  into  account  when 
designing  the  system. 
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If  the  sampling  rats  is  changed  the  prefilter  must  also  be  changed.  With  reasonable 
component  values  it  is  possible  to  construct  analog  prefilters  for  sampling  periods 
shorter  than  a  few  seconds.  For. slower  sampling  rates  it  is  often  simpler  to  sample  once 
per  second  or  faster  with  an  appropriate  analog  prefilter  and  apply  digital  filtering  to 
the  sampled  signal.  This  approach  also  makes  it  passible  to  change  the  sampling  period  of 
the  control  calculations  by  software  only. 

Since  the  analog  prefilter  has  dynamics  it  is  necessary  to  include  the  filter 
dynamics  in  the  process  model.  If  the  prefilter  or  the  sampling  rate  is  changed  it  is 
necessary  to  recompute  the  coefficients  of  the  control  law. 

Crude  estimates  indicate  that  with  normal  sampling  ratesi  like  10-20  times  per 
period*  it  is  indeed  necessary  to  consider  the  prefilter  dynamics. 


Qgmputat  iona  i_del.ay 


Since  A-D  and  D-A  conversions  and  computations  take  time*  there  will  always  be  a 
delay  when  a  control  law  is  implemented  using  a  computer.  The  delay*  which  is  called  the 
cgmgutat iona l._del.ay «  will  depend  on  how  the  control  algorithm  is  implemented.  There  are 
basically  two  different  ways  to  do  this.  The  measured  variables  read  at  time  t  may  be 


used  to  compute  the  control  signal  to  be  applied  at  time  t  .  This  is  called  case  A. 

k+1 

Another  possibility*  case  B*  is  to  read  the  measured  variables  at  time  t^  and  to  make  the 
D-A  conversion  as  soon  as  possible. 


The  first  schema  has  the  disadvantage  that  the  control  actions  are  delayed 
unnecessarily  and  second  schame  has  the  disadvantage  that  the  delay  will  be  variable 
depending  upon  the  programming.  In  both  cases  it  is  necessary  to  take  the  computational 
delay  into  account  when  computing  the  control  law.  This  is  easily  done  by  including  a 
time  delay  of  h  or  x  respectively  in  the  process  model.  Anothar  practical  detail  is  that 
there  is  a  good  rule  to  read  the  inputs  before  the  outputs  are  set  out.  If  this  is  not 
done  there  is  always  the  risk  of  electrical  cross  coupling. 

The  computational  delay  can  be  made  as  small  as  possible  by  making  as  few  operations 
as  possible  between  the  A-D  and  D-A  conversions. 

Consider  the  previously  given  program.  Since  the  control  signal  u  is  available  after 
executing  the  second  line  of  code  the  D-A  conversion  can  be  done  before  tha  state  is 
updated.  The  delay  may  be  reduced  further  by  also  calculating  the  product  C*x  after  the 
D-A  conversion.  The  following  algorithm  is  then  obtained. 

Procedure  Regulate 
begin 

1  Ad  in  y  uc 

2  u  l*  ul  +  D*y  +  Dc*uc 

3  Daout  u 

4  x  ««  F*x  +  G*y  ♦  Gc*uc 

5  ul  »«  C*x 

end 


It  is  useful  to  have  good  estimates  of  computing  times  for  different  control 
algorithms.  A  good  way  to  obtain  these  is  to  run  test  programs.  For  linear  control  laws 
it  is  often  possible  to  estimate  times  from  results  of  a  scalar  product  computation. 

On  simple  microcomputers*  which  do  not  have  floating  point  arithmetic  in  hardware* 
there  will  be  a  substantial  difference  in  computing  time  between  fixed  point  and  floating 
point  operations.  The  difference  is  much  less  if  there  is  hardware  for  floating  point 
operations. 

To  judge  the  consequences  of  computational  delays  it  is  also  useful  to  know  the 
sensitivity  of  the  closed  loop  system  with  respect  to  a  time  delay.  This  may  be  evaluated 
from  a  root  locus  with  respect  to  a  time  delay.  A  simpler  way  is  to  evaluate  how  much  the 
closed  loop  poles  change  when  a  time  delay  of  one  sampling  period  is  introduced. 

Ctfc8sUgQ-9£-Qu£liaii-ftQd-<Diiiui«(D*Q£-s>*l£uac&isa8 

Linear  filtering  theory  is  very  useful  to  reduce  the  influence  of  measurement  noise. 
There  may*  however*  also  be  other  types  of  errors  like  instrument  malfunctions  and 
conversion  errors.  These  are  typically  characterized  by  large  deviations  which  occur  with 
low  probabilities.  It  is*  of  course*  very  important  to  try  to  eliminate  such  errors  so 
that  large  errors  do  not  enter  into  the  control  law  calculations.  There  are  many  good 
ways  to  achieve  this  when  using  computer  control. 

The  errors  may  be  detected  at  the  source.  In  systems  with  high  reliability 
requirements  this  is  done  by  duplication  of  the  sensors.  Two  sensors  are  then  combined 
with  a  simple  logic*  which  gives  an  alarm  if  the  difference  between  the  sensor  signals  is 
larger  than  a  threshold.  A  pair  of  redundant  sensors  may  be  regarded  as  one  sensor*  which 
either  gives  a  reliable  measurement  or  a  signal  that  it  does  not  work. 
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In  more  extreme  cam  three  aaniori  may  ba  usad.  A  measurement  is  then  accepted  as 
long  as  two  out  of  the  three  sensors  agree  (two-out-of-three  logic).  It  is  of  course  also 
passible  to  use  even  more  elaborate  combinations  of  sensors  and  filters. 

It  is  also  possible  to  use  a  Kalman  filter  for  error  detection.  Consider  for  example 
the  control  algorithm  (1)  with  an  explicit  observer.  The  one  step  prediction  error 

c(k)  ■  y  <k>  -  y  <k  | k-1 >  -  y(k)  -  Cx(k|k-i> 

appears  explicitly  in  the  algorithm.  If  estimates  of  the  covariance  matrix  of  the 
prediction  error  are  available  it  is  easy  to  test  if  a  particular  measurement  is 
reasonable*  see  C43 . 

One  possibility  to  obtain  the  error  covariance  is  to  update  the  covariance  equation 
of  the  Kalman  filter  on  line. 

Kalman  filters  and  redundant  sensors  pairs  may  also  be  combined.  If  measurement 
errors  are  checked  in  this  way  it  is  possible  to  obtain  a  very  flexible  system.  The 
scheme  should  be  augmented  with  tests  to  ensure  observability.  It  is  thus  possible  to 
obtain  a  system  which  can  provide  diagnosis  of  sensor  errors. 

Notice  that  the  possibilities  of  making  these  types  of  test  depend  crucially  on  the 
fact  that  the  representation  of  the  control  law  (1)  with  an  explicit  observer  is  used. 

In  computer  control  there  are  also  may  other  possibilities  to  detect  different  types 
of  hardware  and  software  errors.  A  few  extra  channels  in  the  A-D  converter*  which  are 
connected  to  fixed  voltages*  may  be  used  for  testing  and  calibration.  By  connecting  a  D-A 
channel  to  an  A-D  channel  the  D-A  converter  may  also  be  tested  and  calibrated.  The 
computer  may  be  checked  by  performing  calculations  calculations  whose  results  are  known 
and  compare  the  results  with  the  known  values. 


A.  NONLINEAR  ACTUATORS 

Although  linear  theory  has  a  wide  applicability  there  are  often  some  nonlinearities 
which  must  be  taken  into  account.  Actuators  often  have  a  saturation  characteristics.  This 
nonlinearity  may  be  important  when  large  changes  are  made.  There  may  be  difficulties  with 
the  control  system  during  start  up  and  shut  down  as  well  as  during  large  changes  if  the 
nonl inearities  are  not  considered. 

The  rational  way  to  deal  with  the  saturation  is  to  develop  a  design  theory  which 
takes  the  nonlinearity  into  account.  This  can  be  done  using  optimal  control  theory.  Such 
a  design  method  is*  however*  quite  complicated.  The  corresponding  control  law  is  also 
complex.  It  is  therefore  practical  to  use  simple  heuristic  methods. 

The  reason  for  the  difficulties  is  that  the  regulator  is  a  dynamical  system.  Uhen 
the  control  variable  saturates  it  is  necessary  to  make  sure  that  the  state  of  the 
regulator  behaves  properly.  Different  ways  of  achieving  this  are  discussed  below. 

§tate_seace_regulatgrs_with_an_exeIicit_ob3eryer 

Consider  first  the  case  when  the  control  law  is  described  as  an  observer  combined 
with  a  state  feedback  (1).  The  regulator  is  thus  a  dynamical  system  whose  state  is 

represented  by  the  estimated  state  *  in  (1).  In  this  case  it  is  straightforward  to  see 
how  the  difficulties  with  the  saturation  may  be  avoided. 

The  estimator  (1)  will  give  the  correct  estimate  if  the  variable  u  in  (1)  is  chosen 

as  the  actual  control  variable  u  .  If  the  variable  u  is  measured  the  estimate  given  by 

P  P 

(1)  and  the  state  of  the  regulator  are  thus  correct  even  if  the  control  variable 

saturates.  If  the  actuator  output  is  not  measured  it  can  be  estimated  provided  that  the 

nonlinear  characteristics  is  known.  For  the  case  of  a  simple  saturation  the  control  law 
can  thus  be  written  as 


CA  -  KC3  x(k-l | k-1 )  +  Bu  Ck-1) 


sat  u 


’u  low 


u  S  ulow 

ulow  <  u  <  uhigh 

u  >  uhigh 


u 

.uhigh 


for  a  scalar  and 


sat  u 


'sat  u 

1 

sat  u 

2 


sat 


u 


n 


for  a  vector.  The  values  ulow  and  uhigh  are  chosen  to  correspond  to  the  actuator 
,  limitations.  Notice  that  even  if  the  transfer  function  from  y  to  u  for  (1)  is  unstable 

the  state  of  the  system  (4)  will  always  be  bounded  if  the  matrix  A-KC  is  stable.  It  is 

A 

also  clear  that  x  will  be  a  good  estimate  of  the  process  state  even  if  the  value 
saturates  provided  that  ulow  and  uhigh  are  chosen  properly. 

The_general._state_seace_mgde.l 

i 

The  regulator  may  also  be 

xCk+1)  =  F  x (k)  +  G  y<k) 

j  u(k)  =  C  x  <k)  +  D  y <k) 

which  does  not  include  an 
for  simplicity.  If  the  matrix 
variable  saturates  it  is  clear 
is  at  its  limit  and  there  is 
than  continue  to  grow  although 
saturation. 

To  avoid  the  difficulty  it  is  desirable  to  make  sure  that  the  state  of  (5)  assumes 
the  proper  value  when  the  control  variable  saturates.  In  conventional  process  controllers 
this  is  accomplished  by  introducing  a  special  t rack ing_mo^e  which  makes  sure  that  the 
state  of  the  system  corresponds  to  the  input  output  sequence  <u  (k>»  y(k>>.  The  design  of 

P 

a  tracking  mode  may  be  formulated  as  an  observer  problem.  In  the  case  of  state  feedback 
with  an  explicit  observer  the  tracking  is  done  automatically  by  providing  the  observer 

A 

with  the  actuator  output  u  or  its  estimate  u  .  In  the  regulator  given  by  (5)  and  (6) 

P  P 


specified  as  a  state  space  model  of  the  form  (2> 

C5> 

(6) 

explicit  observer.  The  command  signals  have  been  neglected 
F  has  eigenvalues  outside  the  unit  disc  and  the  control 
that  windup  may  occur.  Assume  for  example  that  the  output 
a  control  error  y.  The  state  and  the  control  signal  will 
the  influence  on  the  process  is  restricted  because  of  the 


there  is  no  explicit  observer.  To  get  a  regulator  which  avoids  the  windup  problem  the 
solution  for  the  regulator  with  an  explicit  observer  will  be  imitated.  The  control  law  is 
first  rewritten  as  indicated  in  Fig.  1.  The  systems  in  a)  and  b)  have  the  same 
input-output  relation.  The  system  S  is  also  stable.  By  introducing  a  saturation  in  the 

B 

feedback  loop  in  b)  the  state  of  the  system  S  is  always  bounded  if  y  and  u  are  bounded. 

B 

This  argument  may  formally  be  expressed  as  follows.  Multiply  (&)  by  K  and  add  to  (5) . 
This  gives 


x<k+l)  -  F  x<k'  +  G  y(k>  +  KCu<k)  -  C  x<k>  -  D  yCk)3 


■  CF-KC1  x (k )  +  CG-KDly (k)  +  K  u<k> 

«  F  x<k)  +  G  y(k)  +  K  u<k). 

0  0 


Fig.  1  Different  representat ions  of  the  control  law. 
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If  the  system  (5>)  (6)  is  observable  the  matrix  K  can  always  be  chosen  so  that  F  = 

0 

F  —  KC  has  prescribed  eigenvalues  inside  the  unit  disc.  Notice  that  this  equation  i 
analogous  to  (4).  Applying  the  same  arguments  as  for  the  regulator  with  an  explici 
observer  the  control  law  becomes 


x (k+1 )  -  F  x  (k)  +  G  y(k)  +  K  u(k) 
0  0 


uCk)  -  sat  CCx(k)  +  D  y(k)l. 


(7) 


The  saturation  function  is  chosen  to  correspond  to  the  actual  saturation  in  the 
actuator.  A  comparison  with  the  case  of  an  explicit  observer  shows  that  (7)  corresponds 
to  an  observer  with  dynamics  given  by  the  matrix  F  .  The  system  (7)  is  of  course  also 

0 

equivalent  to  (2)  for  small  signals. 


IriDSffir_Fynct ien_Fgrm 

The  corresponding  constructions  can  also  be  carried  out  for  regulators  characterized 
by  input-output  models.  Consider  a  regulator  described  by 

R(q)  u(k)  »  T(q )  u  (k /  -  S(q)  y(k>  (8) 

c 

where  R>  S  and  T  are  polynomials  in  the  shift  operator.  The  problem  is  to  rewrite 
the  equation  so  that  it  looks  like  a  dynamical  system  with  the  observer  dynamics  driven 
by  three  inputs)  the  command  signal  u  «  the  process  output  y  and  the  control  signal  u. 

c 

This  is  accomplished  as  follows. 

Let  A  <q)  be  the  desired  characteristic  polynomial  of  the  observer.  Adding  A  (q)u(k) 
o  o 

to  both  sides  of  (8)  gives 

A  u  *  Tu  -  Sy  +  (A  -  R)  u 
o  e  o 

A  regulator  with  anti-windup  compensation  is  then  given  by 

A  v  ■  Tu  -  Sy  +  <A  -  R)  u 
o  c  o 

(V 

,u  »  sat  v. 

This  regulator  is  equivalent  to  (8)  when  it  does  not  saturate.  When  the  control 
variable  saturates  it  can  be  interpreted  as  an  observer  with  dynamics  given  by  the 
polynomial  A  . 

a 

# 

A  particularly  simple  case  is  the  case  of  a  dead  beat  observer  i.e.  A  =1.  The 

0 

model  can  then  be  written  as 

u(k)  “  sat  CT  <q  l)  u  <k>  -  S  <q  y(k>  ♦  (1-R*(q  *))  u(k)l  (10) 


5.  NUMERICS 

When  implementing  a  computer  control  system  it  is  necessary  to  answer  questions 
likei  How  accurate  converters  are  needed?  What  precisian  is  required  in  the  computations? 
Should  computations  be  made  in  fixed  point  or  floating  point  arithmetic?  To  answer  these 
questions  it  is  necessary  to  understand  the  effects  of  the  limitations  and  to  estimate 
their  consequences  for  the  closed  loop  system.  This  is  not  a  trivial  question)  because 
the  result  will  depend  on  a  complex  interaction  of  the  feedback)  the  algorithm  and  the 
sampling  rate.  The  real  issues  fortunately  involves  crude  questions  like  10  or  12  bit 
resolution)  24  or  32  bit  wordlength.  Such  questions  may  be  answered  using  simplified 
analysis.  A  detailed  treatment  is  given  in  CS1 . 


ElXQI-IfiUICIl 

The  major  error  sources  are 

-  Quantization  in  A-D  converters. 

-  Quantization  of  parameters. 

-  Round-off)  overflow)  and  underflow  in  addition)  subtraction)  mu  1 1 i pi icat ion) 
division)  function  evaluation  and  other  operations. 

-  Quantization  in  D-A  converters. 


rt  tfl 
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Common  types  of  A-D  converters  have  accuracies  of  8.  10>  12  and  14  bits  which 
corresponds  to  a  resolution  of  0.4  X.  0.1  Xt  0.02S  X  and  0.006  X.  The  percentages  are  in 
relation  to  full  scale.  The  D-A  converters  have  also  a  limited  precision.  An  accuracy  of 
10  bits  is  typical.  The  error  due  to  the  quantization  of  the  parameters  will  depend 
critically  on  the  sampling  period  and  on  the  chosen  realization  of  the  control  law. 


ug£g=lCQg&b 

Digital  control  algorithms  are  typically  implemented  on  micro  and  minicomputers 
which  have  word-lengths  of  8.  16  or  32  bits.  Special  purpose  computers  where  the 
word-length  may  be  chosen  freely  are  used  in  applications  like  the  space  shuttle  or  in 
special  products  which  are  made  in  very  large  quantities. 

There  are  many  differences  in  number  representations.  The  following  representations 
are  common. 


-  Fixed  point  single  precision  16  bit 

-  Fixed  point  double  precision  32  bit 

-  Floating  point  single  precision  8  bit  exponent  24  bit  mantissa 

-  Floating  point  single  precision  8  bit  exponent  56  bit  mantissa 

A  key  problem  is  that  floating  point  operations  are  neither  associative  nor 
distributive. 


Qveryiew_gf _effects_gf_ round-off _and_guanti zat ion 

An  overview  of  the  effects  of  round-off  and  quantization  will  now  be  given.  Tools 
for  analysing  the  effects  will  also  be  discussed. 

The  consequences  of  round-off  and  quantization  depend  on  the  feedback  system  and  on 
the  details  of  the  algorithm.  The  properties  may  be  influenced  considerably  by  changing 
the  representation  of  the  control  law  or  the  details  of  the  algorithm.  It  is  thus 
important  to  understand  the  phenomena. 

A  detailed  description  of  round-off  and  quantization  leads  to  a  complicated 
nonlinear  model  which  is  very  difficult  to  analyse.  Investigation  of  very  simple  cases 
shows.  however,  that  quantization  and  round-off  may  lead  to  limit  cycle  oscillations,  see 
£63  and  £73 . 

Some  properties  of  quantization  and  round-off  in  a  feedback  system  may  also  be 
captured  by  linear  analysis.  Quantization  and  round-off  are  then  modeled  as  ideal 
operations  with  additive  or  multiplicative  disturbances.  The  disturbance  may  be  either 
deterministic  or  stochastic.  This  type  of  analysis  is  particularly  useful  for  order  of 
magnitude  estimation.  It  allows  investigation  of  complex  systems  and  it  is  useful  when 
comparing  different  algorithms,  see  £83  and  £93. 

Techniques  from  sensitivity  analysis  and  numerical  analysis  are  also  useful  to  find 
the  sensitivity  of  algorithms  to  changes  of  parameters.  Such  methods  may  be  used  to 
compare  and  screen  different  algorithms.  The  methods  are.  however,  limited  to  comparison 
of  the  open-loop  performances  of  the  algorithms.  It  is  of  course  also  necessary  to 
compare  the  effects  of  quantization  and  round-off  with  the  other  disturbances  in  the 
system. 


A  control  law  is  a  dynamical  system.  Different  realizations  may  be  obtained  by 
transforming  the  state  space  coordinates.  The  choice  of  a  suitable  realization  is  very 
important  for  the  conditioning.  In  particular  the  companion  forms  are  very  bad  from  a 
numerical  point  of  view,  see  £103.  It  is  much  better  to  represent  a  system  as  a 
combination  of  first  and  second  order  systems. 

If  the  dynamical  system  representing  the  regulator  has  nr  distinct  real  poles  and  nc 
complex  pole  pairs  the  control  algorithm  may  be  transformed  to  the  model  form 


z  <k+l)  »  X  z  <k>  +  0  y<k)  i  ■  l....«nr 
i  i  i  i 


V  (k+1 )  - 

’  a 

i 

u 

i 

v  Ck)  + 

\l' 

i 

-u 

i 

a 

i. 

1 

.*12, 

y  <  k )  i  »  1 ..... nc 


(11) 


u<k)  ■  Dy(k)  +  E  yz(k)+  E  6v(k) 
i  i  i  i 

i-1  i-1 


where  the  complex  poles  are  represented  using  real  variables.  Notice  that  z  are 

i 

scalars  and  v  are  vectors  with  two  elements. 


To  avoid  numerical  difficulties  the  control  law  should  thus  be  transformed  into  the 
form  (6. 4)  which  is  then  implemented  in  the  control  computer.  The  transformation  may 
easily  be  done  in  a  package  for  computer  aided  design.  Notice  that  it  is  easy  to  use 
fixed  point  calculations  and  scaling  for  equations  in  the  form  (6.4). 

If  the  control  law  has  multiple  eigenvalues  a  Jordan  canonical  form  replaces  (6.4). 
An  eigenvalue  X  of  multiplicity  3  thus  corresponds  to  a  block 


X 

1 

0 

[M 

0 

X 

1 

z  (k  )  + 

62 

.0 

0 

X. 

6 

3J 
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The  sampling  period  also  has  a  considerable  influence  on  the  conditioning  as  is 
shown  by  the  following  examples. 


EXAMPLE  -  Effect  of  sampling  period  on  coefficient  precision 

Consider  a  first  order  system  with  time  constant  T.  The  discrete  time  equivalent  of  such 

a  system  is 

x<t+h)  “  axCt)  +  bu(t)> 

whe  re 


-h/T 

a  «  e 

Simple  calculations  show  that 

dT  T  da 

T~  hi"’ 


For  a  given  relative  precisian  in  the  equivalent  time  constant  is  thus  inverse 
proportional  to  the  sampling  period. 


6.  CONCLUSIONS 

Implementat ion  of  control  laws  using  a  computer  have  been  discussed  in  this  paper. 
The  key  problem  is  to  implement  a  discrete  time  system.  The  principles  for  doing  this 
have  been  covered  in  detail.  It  was  shown  that  it  is  straightforward  to  obtain  the  code 
from  the  control  algorithm.  The  importance  of  prefiltering  to  avoid  aliasing  has  been 
mentioned.  Nonlinear  digital  filtering  for  removing  outliers  has  also  been  discussed.  It 
has  been  mentioned  that  the  computational  delay  is  influenced  considerably  by  the 
organization  of  the  computer  code.  Difficulties  which  arise  from  saturation  in  actuators 
and  ways  to  avoid  the  difficulties  have  been  discussed.  This  will  also  automat ica 1 ly  give 
a  solution  to  mode  switching  and  initial ization.  Numerical  problems  and  consequences  of 
finite  word-length  have  also  been  discussed.  It  was  found  to  be  very  beneficial  to 
transform  the  equations  describing  the  control  law  to  a  form  which  is  numerically  well 
conditioned.  Although  the  presentation  is  kept  fairly  brief  the  information  given  should 
be  sufficient  to  implement  control  algorithms  on  mini  and  micro  computers  using  high 
level  languages. 
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14.Abstract 

This  Lecture  Series  is  intended  to  provide  the  basic  concepts,  theories  and  computer 
methods  involved  in  the  design  of  advanced  guidance  and  control  systems. 

The  degree  of  advantages  in  the  application  of  modem  microprocessor  technologies  is 
altready  largely  affected  by  the  way  corresponding  systems  are  designed  in  the  very  early 
stage  of  a  development  programme. 

It  is  intended  to  perform  a  comprehensive  review  of  direct  digital  analysis  and  synthesis 
procedures  and  to  include  in  this  Lecture  Series  computer-aided  and  graphical  techniques 
that  can  be  employed  in  preliminary  design,  synthesis  and  real-time  simulation. 


The  material  in  this  publication  was  assembled  to  support  a  Lecture  Series  under  the 
sponsorship  of  the  Guidance  and  Control  Panel  and  the  Consultant  and  Exchange 
Programme  of  AGARD. 
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It  is  intended  to  perform  a  comprehensive  review  of  direct  digital  analysis  and  It  is  intended  to  perform  a  comprehensive  review  of  direct  digital  analysis  and 
synthesis  procedures  and  to  include  in  this  Lecture  Series  computer-aided  and  graphical  synthesis  procedures  and  to  include  in  this  Lecture  Series  computer-aided  and  graphical 
techniques  that  can  be  employed  in  preliminary  design,  synthesis  and  real-time  techniques  that  can  be  employed  in  preliminary  design,  synthesis  and  real-time 
simulation.  simulation. 


